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SWITCHED NETWORKS WITH MAXIMUM WEIGHT POLICIES: 
FLUID APPROXIMATION AND MULTIPLICATIVE STATE SPACE 

COLLAPSE 

By Devavrat Shah^ and Damon Wischik^ 

Massachusetts Institute of Technology and University College London 

We consider a queueing network in which there are constraints 
on which queues may be served simultaneously; such networks may 
be used to model input-queued switches and wireless networks. The 
scheduling policy for such a network specifies which queues to serve at 
any point in time. We consider a family of scheduling policies, related 
to the maximum-weight policy of Tassiulas and Ephremides [IEEE 
Trans. Automat. Control 37 (1992) 1936-1948], for single-hop and 
multihop networks. We specify a fluid model and show that fluid- 
scaled performance processes can be approximated by fluid model 
solutions. We study the behavior of fluid model solutions under criti- 
cal load, and characterize invariant states as those states which solve 
a certain network-wide optimization problem. We use fluid model re- 
sults to prove multiplicative state space collapse. A notable feature 
of our results is that they do not assume complete resource pooling. 

1. Introduction. A switched network consists of a collection of queues, 
operating in discrete time. In each time slot, queues are offered service ac- 
cording to a service schedule chosen from a specified finite set. For exam- 
ple, in a three-queue system, the set of allowed schedules might consist of 
"Serve 3 units of work each from queues A and B" and "Serve 1 unit of 
work each from queues A and C, and 2 units from queue i?." The rule for 
choosing a schedule is called the scheduling policy. New work may arrive in 
each time slot; let each queue have a dedicated exogenous arrival process, 
with specified mean arrival rates. Once work is served, it may either rejoin 
one of the queues or leave the network. 
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Switched networks are special cases of what Harrison [12, 13] calls "stochas- 
tic processing networks." We believe that switched networks are general 
enough to model a variety of interesting applications. For example, they 
have been used to model input-queued switches, the devices at the heart 
of high-end internet routers, whose underlying silicon architecture imposes 
constraints on which traffic streams can be transmitted simultaneously [8]. 
They have also been used to model a multi-hop wireless network in which 
interference limits the amount of service that can be given to each host [28] . 

The main result of this paper is Theorem 7.1, which proves multiplicative 
state space collapse (as defined in Bramson [3]) for a switched network run- 
ning a generalized version of the maximum-weight scheduling policy of Tas- 
siulas and Ephremides [28], in the diffusion (or heavy traffic) limit. Whereas 
previous works on switched networks and stochastic processing networks in 
the diffusion limit [6, 7, 27] have assumed the "complete resource pooling" 
condition, which roughly means that there is a single bottleneck cut con- 
straint, we do not make this assumption. Section 3 discusses further the 
related work and our reasons for being interested in the case without com- 
plete resource pooling. 

To prove multiplicative state space collapse we follow the general method 
laid out by Bramson [3]. In Section 2 we specify a stochastic switched net- 
work model and describe the generalized maximum-weight policy. In Sec- 
tion 4 we specify a fluid model and prove that fluid-scaled performance 
processes of the switched network are approximated by solutions of this 
fluid model. Sections 5 and 6 give properties of the solutions of the fluid 
model for single-hop and multi-hop networks, respectively. Specifically, for 
nonoverloaded fluid model solutions, we characterize the invariant states 
and prove that fluid model solutions converge towards the set of invariant 
states. In Section 7 we use these properties to prove multiplicative state 
space collapse. 

We use the cluster-point method of Bramson [3] to prove the fluid model 
approximation in Section 4, rather than following an approach based on 
weak convergence. The former yields uniform bounds on the error of fluid 
model approximations, and these uniform bounds are needed in proving 
multiplicative state space collapse. However, the assumptions we make on 
the arrival process are not the same as those of Bramson [3] . 

In Section 8 we give results concerning the fluid model behavior of a gen- 
eral single- hop switched network in critical load, and the set of invariant 
states for the input-queued switch, under a condition that we call "complete 
loading." Motivated by these results, we define a scheduling policy which we 
conjecture is optimal in the diffusion limit. 

Notation. Let N be the set of natural numbers {1, 2, . . .}, let Z+ = {0, 1, 
2, . . .}, let M be the set of real numbers and let M+ = {x E M : j; > 0}. Let 1|.} 
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be the indicator function, where Itrue = 1 and Ifaise = 0. Let x Ay = min(x, y), 
x\/y = max(j;, y) and [x]"*" = xVO. When x is a vector, the maximum is taken 
comp onentwise . 

We win reserve bold letters for vectors in M^, where N is the number 
of queues, for example, x= [x„]i<„<Ar. Superscripts on vectors are used 
to denote labels, not exponents, except where otherwise noted; thus, for 
example, (x'^,x^,x^) refers to three arbitrary vectors. Let be the vector 
of all Os, and 1 be the vector of all Is. Use the norm |x| =max„|xn|. For 
vectors u and v and functions / : R — t- M, let 
N 

U-V='^UnVn, UV = ['U„Z;„]i<„<Ar and f (u) = [f {Un)]l<n<N , 

n=l 

and let matrix multiplication take precedence over dot product so that 

N / N \ 
U • = ^ n„ ^ AnmVm ■ 
n=l \r?i=l / 

Let be the transpose of matrix A. For a set S C M^, denote its convex 
huh by (S). 

For a fixed T > 0, and / G N, let C^{T) be the set of continuous functions 
[0 , T] — 7- , where is equipped with the norm | x | = maxj | x j | . Equip (T) 
with the norm 

11/11= sup 

iG[0,T] 

Let d{f,g) = ||/ — be the metric induced by this norm. For E C C^{T) 
and / G C^{T), let d{f,E) = mf{d{f,g) ig G E}. Define the modulus of con- 
tinuity mc5(-) by 

mc5(/)= sup \f(s)-f{t)\, 

\s~t\<S 

where G [0,T]. Since [0,T] is compact, each / G (T) is uniformly con- 
tinuous, therefore mcs{f) — )• as (5 — )■ 0. 

2. Switched network model. We now introduce the switched network 
model. Section 2.1 describes the general system model. Section 2.2 specifies 
the class of scheduling policies we are interested in and Section 2.3 lists the 
probabilistic assumptions about the arrival process that are needed for the 
main theorems. 

2.1. Queueing dynamics. Consider a collection of queues. Let time be 
discrete, indexed by r G {0, 1, . . .}. Let Qn{T) be the amount of work in queue 
n G {1, . . . , A^} at time slot r. Following our general notation for vectors, we 
write Q(t) for [Qn (''")] i<n<7V- The initial queue sizes are Q(0). Let An{T) be 
the total amount of work arriving to queue n, and Bn{T) be the cumulative 
potential service to queue n, up to time r, with A(0) = B(0) = 0. 
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We first define the queueing dynamics for a single-hop switched network. 
Defining dA(r) = A(r + 1) - A(r) and (iB(r) = B(r + 1) - B(r), the basic 
Lindley recursion that we will consider is 



(1) Q(r + 1) = [Q(r) - dB(r)]+ + dA(r), 

where the [•]"'' is taken componentwise. The fundamental "switched network' 
constraint is that there is some finite set S C such that 

(2) dB(r) G S for ah r. 



We will refer to tt G 5 as a schedule and S as the set of allowed schedules. 
In the applications in this paper, the schedule is chosen based on current 
queue sizes, which is why it is natural to write the basic Lindley recursion 
as (1) rather than the more standard [Q(t) + dA(r) — dB{T)]~^ . 

For the analyses in this paper it is useful to keep track of two other 
quantities. Let 5^,(t) be the cumulative amount of idling at queue n, defined 
by Y(0) = and 

(3) c?Y(r) = [dB(r)-Q(r)]+, 

where (iY(r) = Y(r + 1) — Y(r). Then (1) can be rewritten 

(4) Q(r) = Q(0) + A(r) - B(r) + Y(r). 

Also, let S-jriT) be the cumulative time spent on schedule tt up to time r, 
so that 

(5) B{T) = Y,SAr)7v. 

7rG5 

For a multi-hop switched network, let R G {0, l}^^^ be the routing ma- 
trix, Rmn = 1 if work served from queue m is sent to queue n and Rmn = 
otherwise; if Rmn = for all n, then work served from queue m departs the 
network. For each m we require Rmn = 1 for at most one n. (Tassiulas and 
Ephremides [28] described a network model with routing choice, whereas 
we have restricted ourselves to fixed routing for the sake of simplicity.) The 
scheduling constraint (2) is as before, the definition of idling (3) is as before 
and the queueing dynamics are now defined by 

Qn{T+l)=Qnir) + dAniT)-{dBniT)-dYniT)) + ^RmnidBmiT)-dYm{T)). 

m 

Equivalently, 

(6) Q(t) = Q(0) + A(r) -{I- i?"^)(B(r) - Y(r)). 

Note that A includes only exogenous arrivals to the network, not internally 
routed traffic. We will assume that routing is acyclic, that is, that work 
served from some queue n never returns to queue n. For example. Border 
Gateway Protocol (BGP) utilized for routing in the internet is designed 
to be acyclic [31]. This implies that the inverse R = {I — i?"*")"^ exists; by 
considering the expansion R = I + R^ + (i?"*")^ + • • • it is clear that Rmn G 
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{0, 1} for all m, n and that Rmn = 1 if work injected at queue n eventually 
passes through m, and Rmn = otherwise. When i? = we obtain a single- 
hop switched network. 

A straightforward bound we shall need is 

(7) Q„(r) < Q„(r') + A„(r) - A„(r') + J]fi,™(B„(r) - 5„(rO) 

m 

for r' < r. 

2.2. Scheduling policy. A policy that decides which schedule to choose 
at each time slot r G Z_|. is called a scheduling policy. In this paper we will 
be interested in the max-weight scheduling policy, introduced by Tassiulas 
and Ephremides [28]. We will refer to it as MW. 

2.2.1. Max-weight policy for single-hop network. We describe the policy 
first for a single-hop network. Let Q(t) be the vector of queue sizes at time r. 
Define the weight of a schedule tt S 5 to be tt • Q(t). The MW policy then 
chooses'^ for time slot r a schedule dB(r) with the greatest weight, 

(8) (iB(r) G argmaxTT • Q(r). 

7re<S 

This policy can be generalized to choose a schedule which maximizes tt • 
Q(t)", where the exponent is taken componentwise for some a > 0; call this 
the MW-a policy. More generally, one could choose a schedule such that 

(9) dB(r) e argmaxTT • /(Q(r)) 

7765 

for some function /:M+ — t- M+; call this the MW-/ policy. It is assumed 
that / satisfies the following scale-invariance property: 

Assumption 2.1. Assume / is differentiable and strictly increasing with 
/(O) = 0. Assume also that for any q G and tt G 5, with m(q) = 
maxpgsp- /(q), 

^ ■ /(q) = "^(^l) ■ /(/^q) = '"T'('^q) forallK;GM+. 

This is satisfied by f{x) = x", a > 0, but it is not satisfied, for example, 
for an input-queued switch with f{x) = log(l + x). 



^There may be several schedules which jointly have the greatest weight. To be concrete, 
we might specify some fixed numbering of schedules and choose the highest-numbered 
maximum-weight schedule. Alternatively, we might treat MW not as a policy per se but 
as a constraint on the set of allowed sample paths. For example, in a stochastic setting, 
we might allow ciB(r) to be a random variable, measurable with respect to the underlying 
probability space, satisfying (8) for every randomness. This permits "break ties at ran- 
dom." For the analyses in this paper, it makes no difference which of these two options is 
used. 
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2.2.2. Max-weight policy for multi-hop network. Now we define the multi- 
hop version of the MW-/ scheduhng pohcy. This poUcy chooses a sched- 
ule (iB(r) at time r such that 

dB{T) G argmaxTT • (/ - i?)/(Q(r)). 

Recall that matrix multiplication takes precedence over the • operator, so 
the argmax is of tt • {(/ — i?)/(Q(r))}; note also that 

[^/(Q)]n = Yl RnmfiQm) = f{[RQ]n), 
m 

where [i?Q]n is the queue size at the first queue downstream of n (or if 
there is no queue downstream). Thus 

(10) [(/ - R)fmn = f{Qn) - fimu). 

The difference f{Qn) — f{[RQ,]n) is interpreted as the pressure to send work 
from queue n to the queue downstream of n; if the downstream queue has 
more work in it than the upstream queue, then there is no pressure to send 
work downstream. For this reason, it is also known as backpressure policy. 

As before we will assume that / satisfies a scale-invariance property, the 
multi-hop equivalent of Assumption 2.1: 

Assumption 2.2. Assume / is differentiable and strictly increasing with 
/(O) = 0. Assume also that for any q G and tt G 5, with ?Ti(q) = 
maxpgsp- (/- i?)/(q), 

7r-(7-i?)/(q) = m(q) =^ tv ■ {I - R) f {nq) = m{Kq) forallKGlR+. 

We further require that the scheduler always have the option of not send- 
ing work downstream at any individual queue. Our Lyapunov function, and 
indeed our whole fluid analysis in Section 6, rely on this assumption. 

Assumption 2.3. For the multi-hop setting, assume that S satisfies the 
following: if tt G 5 is an allowed schedule, and p G is some other vector 
with pn G {0, TTn} for all n, then p£ S. 

In the rest of this paper, whenever we refer to a network running the 
MW-/ back-pressure policy, we mean that Assumptions 2.2 and 2.3 are 
satisfied. 

2.3. Stochastic model. Some of the results in this paper are about fluid- 
scaled processes, and others are about multiplicative state space collapse in 
the diffusion scaling, and the different results make different assumptions 
about the arrival process. 

Assumption 2.4 (Assumptions for the fluid scale). Let A(-) be a ran- 
dom process with stationary increments. Assume it has a well-defined mean 
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arrival rate vector A, that is, assume limT—^oo^niT)/T exists almost surely 
and is deterministic for every queue 1 < n < A^, and define 

(11) A„= lim -A„(r). 

Assume there is a sequence of deviation terms 6r G r G N, such that 
5r — as r — )• oo and 

P( sup-|A(r) - Arl > 5r ] asr-;>oo. 
Vr<r r J 

Assumption 2.5 (Assumptions for multiplicative state space collapse). 
Let A''(-) be a sequence of random processes indexed by r G N. For each r, 
assume that A^ has stationary increments, and a well-defined mean arrival 
rate vector A^ , and that there is some limiting arrival rate vector A such that 

A^ — 7- A as r — 7- oo. 

Assume there is a sequence of deviation terms 5z G z G N, such that 
— )• as z — oo and 

2:(log2;)^pfsup-|A^(r) — A''t| > 6zj — )• as z— )• oo, uniformly in r. 

\t<z Z / 

If the arrival process is the same for all r, say A*^' = A where A has 
a well-defined mean arrival rate vector, then Assumption 2.5 reduces to 

(12) f^Lup-\AiT)-XT\>5r)=o(—^ 

\r<rr J \r{\ogr)\ 

and it implies Assumption 2.4. For any arrival process with i.i.d. increments 
that are uniformly bounded, that is, such that there is an A^^^ for which 

An{T + 1) - A„(r) G [0, A'"^"] for ah n, r. 



equation (12) holds with 6r = C ^/\ogr / ^/r , with choice of an appropriate 
constant C that depends on A^^^^ by an application of concentration in- 
equality by Azuma [2] and Hoeffding [14]. More generally, it holds when 
the increments are not uniformly bounded but instead satisfy a reasonable 
moment bound. For example, an application of Doob's maximal inequal- 
ity [10] with bounded fourth moment and 6r = r~^/^ yields a stronger result 
than (12); this can be used to show that a Poisson process satisfies that 
equation. Furthermore (12) holds for a much larger class of stationary ar- 
rival processes beyond processes with i.i.d. increments, for example, Markov 
modulated processes (see Dembo and Zeitouni [9]). 

2.4. Motivating example. An internet router has several input ports and 
output ports. A data transmission cable is attached to each of these ports. 
Packets arrive at the input ports. The function of the router is to work 
out which output port each packet should go to, and to transfer packets to 
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Input 
ports 



input 1 



iput 2 



input 3 



output 1 output 2 output 3 

Output ports 

Fig. 1. An input-queued switch, and two example matching of inputs to outputs. 



the correct output ports. This last function is cahed switching. There are 
a number of possible switch architectures; we will consider the commercially 
popular input-queued switch architecture. 

Figure 1 illustrates an input-queued switch with three input ports and 
three output ports. Packets arriving at input i destined for output j are 
stored at input port i, in queue Qij] thus there are N = 9 queues in to- 
tal. (For this example it is more natural to use double indexing, e.g., Q3,2) 
whereas for general switched networks it is more natural to use single in- 
dexing, e.g., Qn for 1 < n < N.) 

The switch operates in discrete time. In each time slot, the switch fabric 
can transmit a number of packets from input ports to output ports, subject 
to the two constraints that each input can transmit at most one packet and 
that each output can receive at most one packet. In other words, at each time 
slot the switch can choose a matching from inputs to outputs. The schedule 
TT G M.^^ is given by TTjj = 1 if input port i is matched to output port j in 
a given time slot, and vTjj- = otherwise. Clearly tt is a permutation matrix, 
and the set S of allowed schedules is the set of 3 x 3 permutation matrices. 

Figure 1 shows two possible matchings. In the left-hand figure, the match- 
ing allows a packet to be transmitted from input port 3 to output port 2, 
but since (53,2 is empty, no packet is actually transmitted. 

3. Related work. Keslassy and McKeown [20] found from extensive sim- 
ulations of an input-queued switch that the average queueing delay is dif- 
ferent under MW-q policies for different values of a > 0. They conjecture: 

Conjecture 3.1. For an input-queued switch running the MW-a pol- 
icy, the average queueing delay decreases as a decreases. 

Though our work is motivated by the desire to establish Conjecture 3.1, we 
have not been able to prove it. But whereas the two main analytic approaches 
that have been employed in the literature yield results for the input-queued 
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switch that are insensitive to a, our result about multiphcative state space 
collapse is sensitive, as shown in Section 8. We speculate that our result 
might eventually form part of a proof of the conjecture. 

The two main analytic approaches that have been employed in the liter- 
ature are stability analysis and heavy traffic analysis. In stability analysis, 
one calculates the set of arrival rates for which a policy is stable (in the 
sense of [1, 8, 20, 24, 25, 28]). All the prior work in this context leads to the 
conclusion that MW-a has the optimal stability region, regardless of a. 

In heavy traffic analysis, one looks at queue size behavior under a dif- 
fusion (or heavy traffic) scaling. This regime was first described by King- 
man [21]; since then a substantial body of theory has developed, and modern 
treatments can be found in [3, 11, 29, 30]. Stolyar has studied MW-a for 
a generalized switch model in the diffusion scaling, and obtained a complete 
characterization of the diffusion approximation for the queue size process, 
under a condition known as complete resource pooling." This condition ef- 
fectively requires that a clever scheduling policy be able to balance work 
between all the heavily loaded queues. Stolyar [27] showed in a remarkable 
paper that the limiting queue size lives in a one-dimensional state space. 
Operationally, this means that all one needs to keep track of is the one- 
dimensional total amount of work in the system (called the workload), and 
at any point in time one can assume that the individual queues have all been 
balanced. Dai and Lin [6, 7] have established that similar result holds in the 
more general setting of a stochastic processing network. 

Under the complete resource pooling condition, the results in [6, 7, 27] 
imply that the performance of MW-a in an input-queued switch is always 
optimal (in the diffusion scaling) regardless of the value of a > 0. Therefore 
these results do not help in addressing Conjecture 3.1. This is our motivation 
for studying switched networks in the absence of complete resource pooling. 
Technically, the lifting map for a critically-loaded input-queued switch is 
degenerate and insensitive to a under complete resource pooling, but it is 
sensitive to a otherwise. 

We prove multiplicative state space collapse, following the method of 
Bramson [3]. The complement of Bramson's work is by Williams [30], and 
consists of proving a diffusion approximation, using an appropriate invari- 
ance principle along with the multiplicative state space collapse. We do not 
carry out this complementary aspect. Stolyar [27] and Dai and Lin [6, 7] 
have proved the diffusion approximation under the complete resource pool- 
ing condition, and Kang and Williams [17] have made progress toward it 
in the case without complete resource pooling, for an input-queued switch 
under the MW-1 policy. 

Whereas in heavy traffic models of other systems [3, 11, 27, 30] the lift- 
ing map from workloads to queue sizes is linear, we find instead that it is 
nonlinear — in fact it can be expressed as the solution to an optimization 
problem. The objective function of the problem is a natural generalization 
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of the Lyapunov function introduced by Tassiulas and Ephremides [28] for 
proving stability of the MW-1 pohcy; the constraints of the problem are 
closely linked to the canonical representation of workload identified by Har- 
rison [12]. The objective function for MW-a depends on a, and this hints 
that the performance measures might also depend on a. 

Finally, we take note of two related results. First, in [26] we have reported 
some results about a critically loaded input-queued switch without a com- 
plete resource pooling condition. Second, a sequence of works by Kelly and 
Williams [19] and Kang et al. [16] has resulted in a diffusion approximation 
for a bandwidth sharing network model operating under proportionally fair 
rate allocation, assuming a technical "local traffic" condition, but without 
assuming complete resource pooling. They show that the resulting diffusion 
approximation model has a product form stationary distribution. 

4. The fluid approximation. This section introduces the fluid model and 
establishes it as an approximation to a fluid-scaled descriptor of the switched 
network. Intuitively, the fluid model describes the dynamics of the system 
at the "rate" level rather than at finer granularity. The reader is referred 
to a recent monography by Bramson [4] and lecture notes by Dai [5] for 
a detailed account of fluid approximation for multiclass queueing networks. 
In Section 4.1 we specify the fluid model, in Section 4.2 we state the main 
result and in Section 4.3 we prove it. 

4.1. Definition of fluid model. Let time be measured by i G [0, T] for 
some fixed T > 0. Let q, a and y all be continuous functions mapping [0,T] 
into M^, and let s = (s7r)7re5 be a collection of continuous functions map- 
ping [0,T] into M+. Let x(-) = (q(-),a(-),y(-), «(•))■ This lies in C^{T) where 
/ = 3N + \S\. The definition below requires these functions to be absolutely 
continuous; such functions are differentiable almost everywhere, and the time 
instants where they are differentiable are called "regular times." Any equa- 
tions we write involving derivatives are taken to apply only at regular times. 

Definition 4.1 (Fluid model solution for single-hop network). Let /: 
]R-l_ — )• M+ satisfy Assumption 2.1. Say that x{-) is a fluid model solution for 
a single-hop switched network with arrival rate A E operating under the 
MW-/ policy if it satisfies equations (13)~(20) below. Write FMS for the set 
of all such X S C\T). Additionally, define 



FMSk = {x G FMS : |q(0)| < K] 
FMS(qo) = {x G FMS : q(0) = qo}. 



The equations are: 



(13) 
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(14) a(i) = At; 

(15) ^s„{t)=t; 

(16) y{t)<Y.s„{t)7v; 

(17) each s.„{-) and yn{') is increasing (not necessarily strictly increasing); 

(18) all the components of x{-) are absolutely continuous; 

(19) for regular times t, all n Vnit) =0 if quit) > 0; 
for regular times t, all tt G 5 

(20) 

s„{t) =0 if TT • /(q(t)) < maxp • /(q(t)). 

pes 

Here, q(t) represents the vector of queue sizes at time t, a{t) represents the 
cumulative arrivals up to time t, y{t) represents the cumulative idleness up 
to time t and s„{t) represents the total amount of time spent on schedule tt 
up to time t. The equation in (13) is the continuous analog of (4) combined 
with (5), and the inequality is the analog of the single- hop version of (7). 
Equation (14) represents an assumption about the arrival process, related 
to (11). Equation (15) says that the scheduling policy must choose some 
schedule at every timestep. Both (16) and (19) derive from the definition of 
idling, (3). Equation (20) is the continuous analog of (9). 

Definition 4.2 (Fluid model solution for multi-hop network). Let /: 
R-l_ — ?• M+ satisfy Assumption 2.2, and let S satisfy Assumption 2.3. Say 
that x(-) is a fluid model solution for a multi-hop switched network operat- 
ing under the MW-/ policy if it satisfies equations (14)-(19), and addition- 
ally (21) and (22) below. Let FMSm be the set of all such x € C^{T). Also, 
let FMSmi^ and FMSm(qo) be defined analogously to the single-hop case. 
The extra equations are 

(21) q(t) = q(0) + a(t) - (/ - R") ^-(*)^ " y(i)) 

and 

for all regular times t, all tt G 5 

(22) 

s^{t) = if TT • (I - i?)/(q(t)) < maxp • (/ - i?)/(q(t)). 

pes 

When we refer to "fluid model solutions for any scheduling policy," we 
mean processes x(-) E C^(T) satisfying (13) to (19) in the single-hop case, 
or satisfying (14) to (19) plus (21) in the multi-hop case. 
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4.2. Main fluid model result. The development in this section follows the 
general pattern of Bramson [3] . There is, however, a difference in presenta- 
tion that is worth noting. The main result of this section, Theorem 4.3, 
is a general purpose sample path-wise result: it does not make any proba- 
bilistic claim nor does it depend on any probabilistic assumptions. It can be 
applied to a switched network with stochastic arrivals in two ways: to obtain 
a result about fluid approximations (Corollary 4.4), and to obtain a result 
about multiplicative state space collapse (Section 7). 

We start by defining the fluid scaling. Consider a switched network of 
the type described in Section 2.1 running a scheduling policy of the type 
described in Section 2.2. Write X(r) = (Q(r), A(r), Y(r), ^(r)), r G Z+, to 
denote its sample path. Given a scaling parameter z >1, define the fluid- 
scaled sample path x{t) = (q(t), a(i), y(t), s(i)) for t G IR+ by 



after extending the domain of X(-) to M+ by linear interpolation in each 
interval (r, r + 1). In this section we are interested in the evolution of x{t) 
over t G [0,T] for some fixed T > 0, therefore we take x to lie in C^{T) with 
/ = 3A^ + |5|. 

The following theorem concerns uniform convergence of a set of fluid- 
scaled sample paths. Every fluid-scaled sample path is assumed to relate 
to some (unsealed) switched network, and all the switched networks are as- 
sumed to have the same network data, that is, the same number of queues N , 
the same set of allowed schedules 5, the same routing matrix and the 
same scheduling policy. 

The convergence is indexed by a parameter j lying in some totally ordered 
countable set. For Corollary 4.4 we will use j G N, and for Section 7 we will 
use a subset of N x N as the index set. We are purposefully using the symbol j 
here as an index, rather than the r used elsewhere, to remind the reader that 
the index set is interpreted differently in different results. 

Theorem 4.3. Let X he the set of all possible sample paths for single- 
hop switched networks with the network data specified above, running the 
MW-f scheduling policy, where f satisfies Assumption 2.1. Fix X > and 
A G M.^. Let there be sequences ej G M+ and G M^, indexed by j in some 
totally ordered countable set, such that 

(23) Ej — )• and — )• A, as j — )• oo. 

Consider a sequence of subsets Gj C C^{T) x [l,oo) which satisfy the fol- 
lowing: for every (x, z) G Gj there is some unsealed sample path X £ X such 
that X is the fluid-scaled version of X with scaling parameter z (here z is 



q(t) = Qizt)/z, 
y(t)=Y(zt)/z, 



aL{t) = A{zt)/z, 
Snit) = ST,{zt)/z 
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permitted to be a function of X ); and furthermore 

(24) inf{z : (5, z) G Gj} — )• oo as j ^ oo, 

(25) sup sup |a(t) — X^t\ < ej for all j 

{x,z)£Gjt&[0,T] 

and 

(26) sup |q(0)|<ii: for all j. 
Then 

(27) sup d{x,FMSK)^0 

{x,z)€Gj 

Furthermore, fix qo E and a sequence e'j 
also satisfy 

(28) sup |q(0)-qo| <e;- 

{x,z)GGj 

Then 

(29) sup (i(x,FMS(qo)) ^0 asj^oo. 

(x,z)€Gj 

Equivalent results to (27) and (29) apply to multi-hop switched networks, 
with references to FMS replaced by FMSm and the set X modified to refer 
to multi-hop networks running the MW-f scheduling policy where f satisfies 
Assumption 2.2 and S satisfies Assumption 2.3. 

The above theorem as stated appHes to the MW-/ scheduhng pohcy, but 
it is clear from the proof that a corresponding hmit result holds, relating 
sample paths of any scheduling policy to fluid models defined by equa- 
tions (13)-(19). 

The following corollary is a straightforward application of Theorem 4.3. 
It specializes the theorem to the case of a single random system X, and 
the sequence of fluid-scaled versions indexed by r € N where the rth version 
uses scaling parameter r. The arrival process is assumed to satisfy certain 
stochastic assumptions. This corollary is useful when studying the behavior 
of a single switched network with random arrivals, over long timescales. 

Corollary 4.4. Consider a single-hop switched network as described 
in Section 2.1, running the MW-f policy as described in Section 2.2 where f 
satisfies Assumption 2.1. Let the arrival process A(-) satisfy Assumption 2.4, 
and let the initial queue size Q(0) € be random. For r G N, let 

q-(t) = Q(rt)/r, a'^(t) = A(rt)/r, 

r{t)=Y{rt)/r, ~sl{t) = S^{rt)/r, 



as j ^ oo. 
0, and assume that the sets Gj 
for all j . 
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and let x''{t) = (q'"(t), a'^(t), y^(t), for t G [0,r] where T > is some 

fixed time horizon. Then for any 6 > 

P((i(f''(-),FMS(0)) <(5)^1 asr^oo. 

The same conclusion holds for a multi-hop switched network running the 
MW-f back-pressure policy where f satisfies Assumption 2.2 and S satisfies 
Assumption 2.3, with FMS replaced by FMSm. 

Proof. First define the event Er by 

£^^ = |sup-|A(T)-Ar|<5^ and |Q(0)|<Vfl, 

where A and 5r are as in Assumption 2.4. By this we mean that E^. is a subset 
of the probabihty sample space, and we write X{-){oj) etc. for u & Er to 
emphasize the dependence on Er . 

We will apply Theorem 4.3 with index set j = r G N to the sequence of 
sets 

Gj = Gr = {ix''{-){uj),r):oj£Er}. 

In order to apply the theorem we will pick constants as follows. Let K = 1, 
let A be as in Assumption 2.4, A-' = A for all j, Sj = = T6r where 5r is 
as in Assumption 2.4, qo = and e'^ = l/v^- We now need to verify the 
conditions of Theorem 4.3. Equation (23) holds by the choice of A-' and 
by Assumption 2.4. Equation (24) holds automatically by choice of Gj. To 
see that (25) holds, rewrite event E,. in terms of the fluid scaled arrival 
process a*" to see 

sup |a''(t)(a;) — At| < T6r for all r and u £ Er, 

t€[0,T] 

which implies (25); likewise for (26) and (28). We conclude that (29) holds. 
It may be rewritten in terms of Er as 

(30) sup d(x''(-)(w),FMS(0)) ^0 asr^oo. 

We next argue that P(-E'r) — 1 as r — oo . The event Er is the intersection 
of two events, one concerning arrivals and the other concerning initial queue 
size. The probability of the former — )• 1 as r — )• oo by Assumption 2.4. For 
the latter, P(|Q(0)| < ^/r) — 1 as r — oo since Q(0) is assumed not to be 
infinite. Therefore P(£'r) — ?• 1- Combining this with (30) gives the desired 
result for single- hop networks. The multi-hop version follows similarly. □ 

4.3. Proof of Theorem 4-3. We shall present the proof of Theorem 4.3 
for a single-hop network in detail followed by main ideas required to extend 
it to multi-hop networks. 
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4.3.1. Cluster points. Here we are interested in convergence in C^{T), 
where / = 3A^ + |5| and T > is fixed, equipped with the norm || • ||. The 
appropriate concept for proving convergence is cluster points. Consider any 
metric space E with metric d and a sequence {Ei,E2, . . .) of subsets of E. 
Say that x £ E is a cluster point of the sequence if limmij^ood{x, Ej) = 
where d{x, Ej) = mi{d{x, y)-y€ Ej}. 

Proposition 4.5 [Cluster points in C^{T)]'^. Given K >0, A>0 and 
a sequence Bj — )• 0, let 

Kj = {x£ C\T) : \x{<d)\ < K and mc5(x) < A5 + Bj for all 5 > 0}, 

and consider a sequence {Ei,E2,.. .) of subsets of C\T) for which Ej C Kj . 
Then sup^g^, d{y, CP) — )• as j ^ oo, where CP is the set of cluster points 
ofiEi,E2,..). 

4.3.2. Proof of Theorem 4-3. Let Ej = {x : {x, z) G Gj}. Lemma 4.6 below 
shows that Ej C Kj, with Kj as defined in Proposition 4.5 for appropriate 
constants K, A and Bj. By applying that proposition, 

sup d{x, CP) — as J — )■ oo, 

where CP is the set of cluster points of the Ej sequence. Lemma 4.7 below 
shows that all cluster points of the Ej sequence satisfy the fluid model equa- 
tions. Every cluster point x must also satisfy |q(0)| < K, by (26). Therefore 

sup d{x, FMSk) —^0 as j — )■ oo. 

xeEj 

If in addition (28) holds, then every cluster point x must also satisfy q(0) = 
qo. Therefore 

sup d{x, FMS(qo)) — )■ as j ^ oo. 

Lemma 4.6 (Tightness of fluid scaling). Let K and Gj be as in Theo- 
rem 4-3. Then there exist a constant A> and a sequence Bj — t- such that 
for every {x,z) G Gj, |x(0)| < K and 

\x{u) - x{t)\ < A\u-t\ +Bj for allO<t,u<T. 

Proof. Consider {x,z) G Gj, where x = (q, a, y,s). As per the defi- 
nitions in Section 2.1, the only nonzero component of x(0) is q(0) and 
|q(0)| < K by choice of Gj, hence |5;(0)| < K. For the second inequality, 
without loss of generality pick any < t < u < T, and let us now look at 
each component of \x{u) — x{t)\ in turn. 



*Taken from Bramson [3], Proposition 4.1. 
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For arrivals, let A™'^^ = sup^ | A-' | ; this is finite by the assumption that 
A-' —7- A in Theorem 4.3. Then for (i, z) G Gj, 

|a(n) - a(t)| < |a(u) - X^u\ + \X^{u - t)\ + |a(t) - XH\ 

<2ej + \X^\{u-t) by (25) 

<2ej+A"^^^(n-t). 

For idling, let = max,rg5 max„ vr^. This is the maximum amount of 

service that can be offered to any queue per unit time, and it must be finite 
since |5| is finite. Then, based on (3), 

\yniu) - ynit)\ <iu- + 

<{u- + 25"'^^/z™°, 

where z™™ = inf{z : (x, z) S Gj}. For service, let 5'7r(-) be the unsealed pro- 
cess that corresponds to s.„{-); since S.„{-) is increasing and since a schedule 
must be chosen not more than once every time slot, 

\s^{u) - sUt)\ < -{S^{\zu])- S^i[zt\)) <{u-t) + 2/z <{u- t) + 2/zf ^ 
z ■' 

For queue size, note that (4) carries through to the fluid model scaling, that 

is, 

q(t) = q(0) + a(t) - s^{t)7v + y(t), 

TT 

thus 

\qn{u) - qn{t)\ 

< \dn{u) - an{t)\ +^vr„|s^(n) - s^(i)| + \yn{u) - 

TT 

< (u - t)(A"^^" + l^l^"^^'^ + 5"^^") + (2|5|5'"^'^ + 2S"^^")/zf + 2ej. 
Putting all these together, 

(31) \S:{u)-x{t)\<A{u-t) + Bj, 

where the constants are 

A={N + + 2iV5'^^^ + \S\ + iV|5|5"'''^ 

and 

Bj = (47VS'"^^ + 2|5| + 2iV|5|5'^^^)^ + (2 + 2N)ej. 

^min 

By the assumptions of Theorem 4.3, £j — )■ and z™"^ — )• oo as j — >• oo, thus 
Bj — >■ as required. □ 

Lemma 4.7 (Dynamics at cluster points). Make the same assumptions 
as Theorem 4-3, and let Ej = {x : (x, z) G Gj}. Then x G FMSa' if x is a clus- 
ter point of the Ej sequence. 
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Proof. From Lemma 4.6 and Proposition 4.5, it follows that 
limsup^g^;^, d{x, CP) — as j — oo where CP is the set of cluster points 
of the sequence Ej. Let x be one such cluster point. That is, there exists 
a subsequence jk and a collection G Eji_ such that — t- x. It easily 
follows that \x{0)\ < K since |2;-^'''(0)| < K for all x^^ G Ej^ as argued in 
Lemma 4.6. Using this, we wish to establish that x satisfies all the fluid 
model equations to conclude x G FMS^^. For convenience, we shall omit the 
subscript k in the rest of the proof; that is, we shall use j in place of jk and 

Proof of (13), (15), (17). The discrete (unsealed) system satisfies these 
properties; therefore the scaled systems x^ do too. Taking the limit yields 
the fluid equations. 

Proof of (16). In (3), (iB(r) and Q(t) are both nonnegative (component- 
wise), hence (iY(r) < dB(r) for all r. Summing up over r, we see the discrete 
(unsealed) system satisfies the equivalent of (16), so as above we obtain the 
fluid equation. 

Proof of (14). Observe that 

sup |a(t)-At|< sup \si(t)-Si\t)\+ sup \si\t) - XH\+T\X^ - X\. 

tG[0,T] te[0,T] t6[0,T] 

Each term converges to as j — )• oo: the first because x^ ^ x, the second 
because x^ G Ej so the deviation in a? is bounded by £j and £j — )• and 
the third because A-' — )■ A. Since the left-hand side does not depend on j, it 
must be that a(t) = At. 

Proof of (18). In Lemma 4.6 we found constants A and Bj such that for 
all X G Ej 

\x{u) - x{t)\ < A\u -t\+ Bj 

with Bj ^ as j — t- oo. Taking the limit of \x^ (u) — x^{t)\ as j — t- oo, we 
find that \x{u) — x{t)\ < A\u — 1\ ; that is, x is (globally) Lipschitz continuous 
(of order 1 with respect to the appropriate metric as defined earlier). This 
immediately implies that x is absolutely continuous. 

Proof of (19). Since x= (q, a, y,s) is absolutely continuous, each com- 
ponent is too, which means that y„ is differentiable for almost all t. Pick 
some such t, and suppose that qn{t) > 0. Consider some small interval 
/= [t,t-|-(^] about t. Since g„ is continuous, we can choose 5 sufficiently 
small that mis^i Qnis) > 0. Since Hq-' — q|| — )• 0, we can find c > such that 
infs£i qn{s) > c for all j sufficiently large. Since x^ G Ej, there exists a cor- 
responding unsealed version of the system, say , and scaling parameter, 
say Zj, so that x^{-) = (zj-)/ Zj . Therefore, it must be that the correspond- 
ing unsealed queue satisfies inig^j Qn{zjs) > ZjC. That is, the queue size in 
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the entire interval never vanishes to and hence idhng in the entire interval 
is not possible. Therefore after rescaling we find yii(t + 5/2) = yii(t). (The 
switch from 6 to 5/2 sidesteps any discretization problems.) Therefore the 
same holds for y„ in the limit. We assumed y„ to be differentiable at t; the 
derivative must be 0. 

Proof of (20). Pick a t at which s^r is differentiable, and suppose that 
7r-/(q(t)) < maxpp-/(q(t)). As above, pick some small interval /= + 
and j sufficiently large that 

TT • /(q-'(s)) < maxp • /(q''(s)) for s € /. 
pes 

Writing this in terms of the unsealed system and applying Assumption 2.1, 

TT • f{Q^Zjs)) < maxp • f{Q,^{zjs)) for s G /. 
pes 

The MW-/ policy ensures by (9) that tt will not be chosen throughout this 
entire interval, so after rescaling we find si{t + 5/2) - si{t) = 0, and taking 
the limit gives s,r(i + 5/2) = s,r(i)- Since s,r is assumed to be differentiable 
at t; the derivative must be 0. □ 

4.3.3. Proof of Theorem 4-3 for multi-hop networks. The proof of Theo- 
rem 4.3 for single-hop network applies verbatim, except that the two lemmas 
need to be replaced. 

Lemma 4.6 (Tightness of fluid scaling) — > Lemma 4.8. 

Lemma 4.7 (Dynamics at cluster points) — > Lemma 4.9. 

Lemma 4.8 (Tightness of fluid scaling). Make the same assumptions as 
Theorem 4-3, multi-hop case, and use the same definition ofGj. Then there 
exist a constant ^ > and a sequence Bj — )• such that for every (x, z) G Gj, 
|i(0)| < K and 

\x{u)-x{t)\ < A\u-t\+Bj for allO<t,u<T. 

Proof. Consider {x,z) G Gj, x = (q,a,y,s). The bound \x{0)\ < K fol- 
lows from an argument similar to that in the single-hop case. The bounds on 
the arrival process, the idleness and service allocation are as in the single-hop 
case: for any <t < u <T, 

\a{u)-aL{t)\ < {u 

\yn{u) - yn{t)\ < (u 

\Snit) - S7r(s)| < {u 



t)A"^^^ + 2e 



J' 

min 



t) + 2/zf°. 
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where z™™ = mf{z : {x, z) G Gj}. The bound on queue size is a httle different. 
Note that (6) carries through to the fluid-scahng, that is, 

q(t) = q(0) + a(t) -{I- R") ^ 5„(t)7r + y(t), 

thus 

\qn{u) - qn{t)\ < \an{u) - an{t)\ + - R^)T^]n\\s-K{u) - S,r(t)| 

77 

+ \yn{u) -yn{t)\ 
<{u- t)(A""^^ + |5|(Ar5°^^^)5""^^ + 5°^^^) 

Putting ah these together, for any {x^z) G Gj, 

\x{u) - x{t)\ < A{t -s) + Bj, 
where the constants A and Bj are 

(32) I I I IV ; 

Bj = (4iV5™^^ + 2|5| +2|5|(iV5'"^^)2)^^ + {2N + 2)ej. 

Here Bj — )• as j — )■ cxd since Ej — )• by (23) and — )• oo as j — )■ cxd by (24). 
□ 

Lemma 4.9 (Dynamics at cluster points). Under the setup of Theo- 
rem 4-3 for a multihop network, let Ej = {x : (x, z) G Gj}. Then x G FMSmi^ 
if X is a cluster point of the Ej sequence. 

Proof. Given a cluster point x = (q, a, y, s), let there be [x^ ,Zj) G Gj 
so that x^ — )• X, as in the proof of Lemma 4.7. Now the bound |2;(0)| < K and 
equations (14)-(19) all work exactly as in the single-hop case, as does the 
queue size equation (21). The only equation that needs further argument is 
the MW-/ backpressure equation (22). 

Proof of (22). Pick a t at which s^r is differentiable, and suppose that 
TV- {I- R)f{q{t)) < maxp p • (/ - i2)/(q(t)). As in Lemma 4.7, proof of (20), 
it must be that there is some small interval / = [zjt, Zjt + Zj5] such that tt 
is not chosen for any t & I, therefore S7r(t) =0. □ 

5. Fluid model behavior (single-hop case). In this section we prove cer- 
tain properties of fluid model solutions, which will be needed for the main 
result of this paper, multiplicative state space collapse. In order to state 
these properties, we first need some definitions. We then state a portman- 
teau theorem listing all the properties, and give an example to illustrate the 
definitions. The rest of the section is given over to proofs and supplementary 
lemmas. 
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This section deals with a single-hop switched network; in the next section 
we give corresponding results for multi-hop. Our reason for giving sepa- 
rate single-hop and multi-hop proofs, rather than just treating single-hop 
as a special case of multi-hop, is that our multi-hop results place additional 
restrictions on the set of allowed schedules (Assumption 2.3) beyond what 
is required for single-hop networks. This mainly affects the proof; the port- 
manteau theorem for multi-hop is nearly identical to that for single- hop. 

Definition 5.1 (Admissible region). Let S C be the set of allowed 
schedules. Let (S) be the convex hull of 5, 

~ I ^■"''^ • '^■^ ~ ^' Q^TT > for all TT > . 

Define the admissible region A to be 

A = {A G : A < cr componentwise, for some a G (5)}. 

Definition 5.2 (Static planning problems and virtual resources). De- 
fine the optimization problem PRIMAL(A) for A G to be 

minimize a.„ over G M+ for all tt G 5 
7re5 

such that A < a^^n componentwise. 

Let DUAL(A) be the dual to this: it is 

maximize f • A over £ G such that maxf • tt < 1. 

7re<S 

Let E be the set of extreme points of the feasible region of the dual problem; 
the feasible region is a finite convex polytope so E is finite. Define the set 
of virtual resources S* C to be the set of maximal extreme points, 

S* = £ E -.for all C e E,^ < C ^ ^ = C}- 
Define the set of critically loaded virtual resources H(A) to be 

H(A) = {£GcS*:£-A = 1}. 

Both problems are clearly feasible, and the optimum is attained in each. 
By Slater's condition there is strong duality, that is, PRIMAL(A) = DUAL(A). 
[When we write PRIMAL(A) or DUAL(A) in mathematical expressions, we 
mean the optimum value, not the optimizer.] Clearly, PRIMAL(A) < 1 if 
and only if A is feasible. 

Laws [22, 23] and Kelly and Laws [18] used primal and dual problems 
of this general sort for describing multi-hop queueing networks with rout- 
ing choice. Harrison [12] extended the problems for stochastic processing 
networks. 
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Definition 5.3 (Lyapunov function and lifting map). Let the schedul- 
ing policy be MW-/, where / satisfies Assumption 2.1. Define the function 
L : ^ M+ by 

L(q)=F(q).l, 

where F{x) = Jq f{y)dy for x G M, and F{q) = [F{qn)]i<n<N as per the 
notation in Section 1. Define the optimization problem ALGD(q) to be 

minimize L{r) over r G 

such that ^ • r > ^ • q for all ^ € "(■^) and 
fn ^ Qn for all n such that A„ = 0. 

Note that F is strictly convex and increasing, and the feasible region is 
convex; hence this problem has a unique optimizer. Define the lifting map 
AW-.M^ by setting AW{q) to be the optimizer. 

Note that ALGD and AW both depend on A and /, but we will surpress 
this dependency when the context makes it clear which A and / are meant. 

The results in this section apply to any A G A. However, if PRIMAL(A) < 1, 
then H(A) is empty, so AVF(q) = for all q. The results are only interesting 
when PRIMAL(A) = 1, so we define 

9A = {A G A : PRIMAL(A) = 1}. 

We can now state the main result of this section. 

Theorem 5.4 (Portmanteau theorem, single-hop version). Let A G A. 
Consider a single-hop switched network running MW-f , where f satisfies 
Assumption 2.1. 

(i) For any K < oo, {q G : L(q) < K} is compact. Also, for any fluid 
model solution with arrival rate A, L(q(t)) < L(q(0)) for all t>0. 

(ii) AW is continuous. 

(iii) // q = AW{q), then AW{Hiq) = KAW{q} for all k>0. 

(iv) Say that is an invariant state if all fluid model solutions q(-) with 
arrival rate A, starting at q(0) = q'^, satisfy q(t) = q'^ for all t>0. Then 

is an invariant state •<=^ q*^ = AW{c^). 

(v) For any e > there exists some Hf, < oo such that, i/q(-) is a fluid 
model solution with arrival rate A, and |q(0)| < 1, then |q(t) — AM^(q(t))| < e 
for all t>Hs. 

A loose interpretation of these results is that the MW-/ scheduling policy 
seeks always to reduce L{q) [part (i)], but it is constrained from reducing it 
too much, because it is not permitted to reduce the workload at any of the 
critically loaded virtual resource (the constraints of ALGD). However, it can 
choose how to allocate work between queues, subject to those constraints. 
It heads towards a state where it is impossible to reduce L{q) any further 
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[parts (iv) and (v)]. In all the examples we have looked at, the fluid model 
solutions reach an invariant state in finite time, that is, (v) holds also for 
e = 0, but we have not been able to prove this in general. 

5.1. Example to illustrate A, dA, S* and H. Consider a system with 
N = 2 queues, A and B. Suppose the set S of possible schedules consists 
of "serve three packets from queue A" and "serve one packet each from A 
and Write these two schedules as tt ^ = (3,0) and tt^ = (1,1), respec- 
tively. Let Ayi and As be the arrival rates at the two queues, measured in 
packets per time slot. 

Determining A and dA. The arrival rate vector A = (A^, A^) is feasible if 
there is some <t = (1 — x)7r^ + xtt^ with < x < 1 such that A < <t. In words, 
the arrival rates are feasible if the switch can divide its time between the two 
possible schedules in such a way that the service rates at the two queues are 
at least as big as the arrival rates. Schedule tt^ is the only schedule which 
serves queue B, so we would need x > Xb- IfAB>l, then it is impossible 
to serve all the work that arrives at queue B. Otherwise, we may as well set 
X = Xb- The total amount of service given to queue A is then 3(1 — x) + x = 
3 — 2Xb; if Aa < 3 — 2Ab, then it is possible to serve all the work arriving at 
queue A. We have concluded that 

A = {{Xa, Xb) : Ab < 1 and ^Xa + fAs < 1}. 

Further algebra tells us that 

PRIMAL(A) = max(AB, ^A^ + |Aij). 

Hence 

9A = {(Aa,Ab)gA:Ab = 1 or iAA + |AB = l}. 

Determining S* and H. The feasible region of DUAL (A) is 

{(6,6) G 36 < 1 and 6 + 6 < !}• 
The extreme points may be found by sketching the feasible region; they are 
(0,0), (|,0), (5,!) and (0,1). Clearly the maximal extreme points, that is, 
the virtual resources, are 

5* = {(i,|),(0,l)}. 

The set of critically loaded virtual resources depends on Xa and A^ : (0, 1) G 
H(A) iff Xb = 1, and (i,|)GH(A) iff A^/S + 2Ab/3 = 1. 

Interpretation of virtual resources.^ Each virtual resource ^ G 5* may 
be interpreted as a virtual queue. For example, take ^ = (|) f), and define 
the virtual queue size to be ^ • Q = Qa/S + 2Qb/3. Think of the virtual 



'^cf. Laws [22], Example 4.4.3. 
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queue as consisting of tokens: every time a packet arrives to queue A put 3 
tokens into the virtual queue, and every time a packet arrives to queue B 
put in I tokens. The schedule vr^ can remove at most 3 x | = 1 token, and 
schedule tt^ can remove at most | + | = 1 token. In order that the total rate 
at which tokens arrive should be no more than the maximum rate at which 
we can remove tokens, we need 

Aa/3 + 2Ab/3<1, 

that is, A • ^ < 1. If DUAL(A) = PRIMAL(A) > 1, then there is some ^ e S* 
such that A • ^ > 1, which means that the corresponding virtual queue is 
unstable; hence the original system is unstable. 



5.2. Proofs for the portmanteau theorem. Throughout this subsection we 
consider a single-hop switched network running MW-/ with arrival rates 
AG A. 

The first claim of Theorem 5.4(i), that {q G M.^ ^ is compact 

for any K <oo, follows straightforwardly from the facts that L{q) — )■ 00 as 
|q| 00, and L(-) is continuous. The second claim follows from a standard 
result (first given by Dai and Prabhakar [8], for an input-queued switch), 
which we include here for the sake of completeness. 

Lemma 5.5. ForallqeR^, 
(33) A-/(q)-max7r-/(q)<0. 

ttGo 

Also, every fluid model solution satisfies 

^L(q(t)) = A • f{q{t)) - maxTT • /(q(t)) < 0. 
at 7re<s 

Proof. Since A G A, we can write A < cr componentwise for some cr = 
J2-7r "ttTt with a,r > and ^ = 1. Hence 

A • /(q) - maxp • /(q) = Va^vr • /(q) - maxp • /(q) 
p ^ p 

For the claim about fluid model solutions, 
|L(q(t)) = q(t) • /(q(i)) 

A - ^ s^(t)7r + y(t) j • /(q(t)) by differentiating (13) 
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A - J] sAt)A ■ /(q(i)) by (19), using /(O) = 
= A • /(q(0) - maxp • /(q(t)) ^ by (20) 

TT 

= A-/(q(i))-maxp./(q(i)) by (15) 
p 

< by (33). □ 

To prove Theorem 5.4(ii), it is useful to work with a "fuher" representation 
of the hfting map. Let E be the set of extreme feasible solutions of DUAL(A), 
and define 

(34) H+(A) = {^Gi?:^.A=l}. 

This includes nonmaximal extreme points, whereas H(A) only includes max- 
imal extreme points. 

Lemma 5.6. The lifting map AVF(q) is the unique solution to the opti- 
mization problem ALGD^(q), 

minimize L{r) over r G 

such that ^ • r > ^ • q for all ^ G H'^(A). 

Proof. ALGD'*"(q) has a unique minimum for the same reason that 
ALGD(q) has a unique minimum. 

Next we claim that if r is feasible for ALGD(q) then it is feasible for 
ALGD'''(q). Pick any ^ G H"'"(A). By definition, ^ is an extreme feasible 
solution of DUAL (A) and ^ • A = 1. Since it is an extreme feasible solution, 
^ < ^ for some virtual resource ^ G 5*. Since ^ • A = 1 we know C " ^ !> but 
by assumption A G A; hence • A = 1 and furthermore < Cn only for n 
where = 0. Now, 

^•r-^.q=(C-r-C-q) + (^-C)-(r-q). 

We assumed that r is feasible for ALGD(q); by the first constraint of 
ALGD(q) the first term in the preceding equation is positive; by the second 
constraint the second term is positive. We have shown that ^ • r > ^ • q for 
all ^ G H+(A); hence r is feasible for ALGD+(q). 

Next we claim that if r is optimal for ALGD"^(q), then it is feasible 
for ALGD(q). Clearly it satisfies the first constraint of ALGD(q). Suppose 
it does not satisfy the second constraint, that is, that r„ > qn for some n 
where A„ = 0, and define r' hj r'^ = rm \i m^n and r'^ = Then r' < r, 
hence L{r') < L{r). Also, r' is feasible for ALGD'^(A). To see this, pick any 
C G S"''(A), and let ^ G H"'"(A) be such that (m = (,m '^^ rn n and = 0. 
Then 

C • r' = ^ • r' + Cnr'n = ^ • r + Cnr'n > ^ • q + Ur'n = C • q- 
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The inequality is because r is feasible for ALGD^(q). This contradicts op- 
timality of r. 

Putting these two claims together completes the proof. □ 

With this representation, the lifting map AW can be split into two parts. 
Let H+(A) = {^\...,^^} and define the workload map W-.R^^R^ 
W{q) = [C ■ (l]i<v<v- Also define A : ^ by 

(35) A{w) = argmin{L(r) : r G and • r > u?^ for 1 < w < V}. 

(This has a unique optimum for the same reason that ALGD^ and ALGD 
have.) Then the lifting map is simply the composition of A and W. It is 
clear that W is continuous; to prove Theorem 5.4(ii) we just need to prove 
that A is continuous. 

Lemma 5.7. A is continuous. 

Proof. If H^(A) is empty, then A is trivial and the result is trivial. In 
what follows, we shall assume that H"'"(A) is nonempty, and we will abbrevi- 
ate it to H"*". Furthermore note that for every ^ G H"*" there is some queue n 
such that > 0; this is because ^ • A = 1 by definition of H+. 

Pick any sequence ^ w ^ M^, and let r'^ = A(u;'^) and r = A(w). We 
want to prove that ^ r. We shall first prove that there is a compact 
set [0,/i]^ such that e [0,h]^ for all k. We shah then prove that any 
convergent subsequence of r*^ converges to r; this establishes continuity of A. 

First — compactness. A suitable value for h is 

h = max max sup — ^ . 

l<v<Vn:i„>0 k 

Note than the maximums are over a nonempty set, as noted at the beginning 
of the proof. Note also that h is finite because w is finite. Now, suppose that 
r'^ ^ [0, h]^ for some k, that is, that there is some queue n for which > /i, 
and let r' = r*^ in each component except for = h. We claim that r' satisfies 
the constraints of the optimization problem for A{w^). To see this, pick any 

G ; either = in which case • r' = • r'^ > , or .^^ > in which 
case • r' > ^J^/i > by construction of h. Applying this repeatedly, if 
r'^' ^ [0,/i]^, then we can reduce it to a queue size vector in [0, /i]^, thereby 
improving on L(r'^), yet still meeting the constraints of the optimization 
problem for A(u;'^); this contradicts the optimality of r'^. Hence G [0, /i]^. 

Next — convergence on subsequences. With a slight abuse of notation, let 
A{w'^) = r*^ — )• s be a convergent subsequence, and recall that A{w) = r and 

— )• w. By continuity of the constraints, s is feasible for the optimization 
problem for A{w)\ we shall next show that L{s) < L{r). Since r is the unique 
optimum, it must be that s = r. 
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It remains to show that L{s) < L{r). Consider the sequence r + e^l as 
candidate solutions to the problem A[w^) where 

e = max 



l<v<V ' 

This choice ensures that the candidates are feasible, since 

• (r + e^l) = • r + e^^'' • 1 > • r + - > u;^. 

(If we had used ^ G H rather than ^ G H"*", it would not necessarily be true 
that the candidates are feasible; this is why we introduced Lemma 5.6.) 
Since the candidates are feasible solutions to the problem A{w''), and r'^' is 
an optimal solution, it must be that 

L(r^) <L(r + e^'l). 

Taking the limit as A; — t- oo, and noting that L is continuous and e*^ — )• 0, we 
find 

L(s) < L(r) 
as required. This completes the proof. □ 

For the proof of Theorem 5.4(iii), it is useful to work with a different 
representation of the constraint of ALGD, provided by the following lemma. 

Lemma 5.8. 

(i) AW{q) = [q + t{X - <t)]+ for some t>0 and a e {S). 

(ii) [q + t{\ - cr)]+ is feasible for ALGD(q) for aHt>0 and cr e{S). 

Proof, (i) We will shortly prove that the following are equivalent, for 
all q and r G R^: 

(36) r>q + t(A-cr) for some t > 0, cr G (5); 

(37) ^•r>^-q for alUGH+(A). 

We use this equivalence as follows. From Lemma 5.6 we know that AM^(q) 
is the solution of ALGD+(q). That is, letting q' = AW{q), equation (37) 
holds with q' in the place of r. Hence (36) holds for some t>0 and cr G (S) ; 
moreover since q' > it must be that 

q'>[q + t(A-^)] + . 

We claim that this inequality is in fact an equality. To see this, note that 
r = [q + t(A — cr)]+ satisfies (36); hence it satisfies (37); hence it is a feasible 
solution of ALGD''~(q). Note also that L(-) is increasing componentwise, 
hence L(q') > -^^(r). But ALGD~''(q) has a unique minimum, hence q' = r 
as required. This completes the proof of Lemma 5.8(i), once we have proved 
the equivalence between (36) and (37). 
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Proof that (36) =^ (37). Pick any ^ G H+(A). By definition of H+(A), 
we know: ^ > 0; ^ • tt < 1 for all tt G 5, hence ^ • <t < 1 for all cr G {S); and 
^ • A = 1 . Hence 

^ • r > ^ • q + t{^ • A — ^ • cr) assuming q, r G satisfying (36) 
= C-q + i(l-^-cr)>^-q + t(l-l)=^-q. 

Proof that (36) (37). Let q and r satisfy (37), and let cr' = A — (r — 
q) /t for some sufficiently large t G M+ . We shortly show that the value of 
DUAL(cr') at its optimum is < 1. By strong duality the value of PRIMAL(cr') 
at its optimum is likewise < 1, and so by definition of PRIMAL(cr') we can 
find some cr G (5) such that cr' <cr componentwise. Then 

r = q + t(A - cr') > q + t(A - cr), 

that is, r satisfies (36). 

It remains to show that the value of DUAL (cr') at its optimum is < 1, 
that is, that C ' ^ 1 for all dual-feasible ^. We have assumed that A G A, 
hence C ' ^ 1- On one hand, if C ■ = Ij then it follows from the definition 
of H+ that C G i'^'^W), hence 

C-^' = C-A-C-(r-q)/t 

= 1 — C ■ (r — q)/t since ^ • A = 1 

< 1 by (37). 

On the other hand, if ^ • A < 1, then 

C • <t' < 1 - C • (r - q)/t, 

and this is < 1 for t sufficiently large. Either way, C ' ^' ^ 1- Therefore the 
value of DUAL (cr') at its optimum is < 1. 

(ii) For this, we need to check two feasibility conditions of ALGD(q). The 
first feasibility condition is 

I- [q + t(A- cr)]+ >^-q for aU ,t G H(A). 

Pick any ^ G H(A). By definition of H(A), ^ > 0, ^ • tt < 1 for all tt G 5 hence 
^ • cr < 1 for all cr G (S) , and ^ • A = 1, thus 

^•[q + t(A-<T)]+>^-[q + t(A-cr)]=^.q + t(l-^.<T)>^-q 

as required. The second feasibility condition is that if = for some n, 
then 

[qn + t{Xn - CTn)]^ = [Qn " tan]"^ < Qn- 

This is true because cr > componentwise for all cr G (S) . □ 

Theorem 5.4(iii) is a corollary of the following lemma. 

Lemma 5.9 ( Scale-invar iance of AW). Let qGM^. Then AW{k.c[) = 
K X AP^(q) for all k > 0. 
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Proof. We will first establish three preliminary properties of AW. Pre- 
liminary 1 is used to prove 2, and 2 and 3 are used in the main proof. 

Preliminary 1. If q = AW{q') for some q' G then 

(38) A-/(q)=max7r-/(q). 

7765 

To see this, suppose tv gS has maximal weight and consider r = [q + t(A — 
tt)]"'". This is feasible for ALGD(q) by Lemma 5.8. Now, using the fact that 
/(0)=0, 

|L([q + f(A-7r)] + )|,^0 = (A-7r)./(q). 

Since q is optimal for ALGD(q') it is optimal for ALGD(q), hence A - /(q) > 
^ ' /(q)- On the other hand, A € A so A < cr for some cr G (5), hence A • 
/(q) < ■ /(q) ^ • /(q)- Hence the result follows. 

Preliminary 2. Suppose that r = AW{q). From Lemma 5.8, r = [q + 
t(A — cr)]"'' for some t > and a G {S). Then either t = or 

(39) cr./(r)=max7r-/(r). 

7765 

This is because t is an optimal choice, so either t is constrained to be or 

|-L([q + n(A - a)] + )L=, = (A - ^) • /(r) = 0. 

In this second case, A • / (r) = max^r tt • /(r) by (38) so the same is true for cr. 

Preliminary 3. Suppose that r = A14^(q). From Lemma 5.8, we can write 
it as r = [q+ t(A — cr)]'^ for some cr G (5) . In fact, for any T >t we can write 
it as 

(40) r= [q + r(A-p)]+ for some pG (cS). 

To see this, recall that PRIMAL(A) < 1, so we can pick some A G {S) such 
that A < A, whence 

r > [q + i(A - cr) + (T - t)(A - A)] + 

= [q + r(A-p)]+ wherep=^(T + ^AG(5). 

This last expression is feasible for ALGD(q) by Lemma 5.8. Since r is 
optimal for ALGD(q), and the objective function is increasing pointwise, 
r = [q + T(A — p)]^ as claimed. 

Main proof. Let r = AVF(q) and kt' = AVF(Kq). We know that kt is 
feasible for ALGD(Kq) because the constraints are linear; we will now show 
that L{Kr) < L(«:r'); hence kt is also optimal for ALGD(Kq). By uniqueness 
of the optimum, kt = kt' as required. 
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It remains to prove that L[kt) < L{kt'). Since r solves ALGD(q) and kt' 
solves ALGD(Kq), we can use Lemma 5.8 to write 

r = [q + t(A - cr)] + , nr' = [/tq + Kt'(A - cr')] + 

for t, i'sM^ and er^er' ^ {S). Indeed, for T>max(t,t') we can use (40) to write 

r = q + r(A-p + y) for p G (5), y G M^" 

where = if r„ > 0, 

r' = q+T(A-p' + y') for p' G (5) , y' G M^' 

where = if r,^ > 0. 

Now consider the value of L(-) along the trajectory from kt to kt' . Along 
this trajectory, 

——L(kt + (r' — r)u/T) = (r' — r) • f{Kr)/T 

= (p-p'-y + y')-/M 

> (p - p' - y) • /(Kr) since y' > 

= (p - p') • /(Kr) since y„, = if r„ > 

> p • /(^r) — maxTT • /(ftr) for any p' G (S) 

= 0. 

The final equality is because p • /(r) = max„ tt • /(r) by (39), so p • /(^r) = 
maxTrTT- /{kt) by Assumption 2.1. Since L(-) is convex, it follows that 
L(Kr') > L{k,v). This completes the proof. □ 

The proof of Theorem 5.4(iv) relies on the following lemma. 

Lemma 5.10 (Fluid model trajectories preserve ALGD feasibility). Con- 
sider any fluid model solution, for any scheduling policy, with initial queue 
size q(0). Then q(t) is feasible for ALGD(q(0)) for allt>0. 

Proof. Pick any critically loaded virtual resource ^ G S(A). By (13), 
^ • q(t) = ^ • q(0) + 1(^ • A - ^ • cr(t)) + ^ • y(t) where cr(t) = ^ 7TS„{t)/t 
> ^ ■ q(0) + 1(^ • A - ^ • a{t)) since y{t) > 
>^-q(0)+t(l-l)=^-q(0). 

The last inequality is because $, G H(A) ; so ^ • A = 1, and ^ • tt < 1 for all tt G 5 
hence ^ • cr < 1 for ah cr G (5). Finally q(t) < q(0) +tA by (13) and (16), and 
this yields the second constraint of ALGD (A) for queues n with arrival 
rate. □ 
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Theorem 5.4(iv) is impHed by parts (i) and (ii) of the following lemma. 

Lemma 5.11 (Characterization of invariant states of MW-/). The fol- 
lowing are equivalent, for G M.^ : 

(i) qO = AW^(qO); 

(ii) q*^ is an invariant state; 

(iii) there exists a fluid model solution with q(t) = q*^ for all t; 

(iv) A-/(qO) = max„65 7r-/(qO). 

Proof. 

Proof that (i) =^ (ii). Suppose that q*^ = AVF(q°), that is, that q° is 
optimal for ALGD(q'^), and consider any fluid model solution which starts 
with q(0) = q*^. On one hand. Lemma 5.5 says that L(q(t)) < L(q'^). On 
the other hand. Lemma 5.10 says that q(t) is feasible for ALGD(q'^). Since 
ALGD(q'') has a unique solution, it must be that q(t) = q*^. 

Proof that (ii) =^ (iii). It is easy to find a fluid model solution which 
starts at q(0) = q*^: a limit point of the stochastic model from Theorem 4.3 
will do. By (ii), the queue size vector is constant. 

Proof that (iii) =^ (iv). Suppose there is a fluid model solution with 
q(t) = q°. Since q(-) is constant, L(q(t)) = 0. Lemma 5.5 says that L(q(t)) < 0, 
so the inequality in the proof must be tight for all t, that is, 

(41) A./(q0)=max7r./(q0). 

7re5 

Proof that (iv) =^ (i). If q*^ = then the result is trivial. Otherwise, 
let r = AVF(q''). By Lemma 5.8, r = [q" + t{X — <t)]+ for some t > and 
cr € (5). Consider the value of L(-) along the trajectory from q*^ to r 

" - ={X-a)- /(qO) relying on /(O) = 

u=0 



^^-L([<l» + (A-,H 



( maxTT • /(q°) ) - <t • /(q°) by part (iv) 



> because cr G {S). 

By convexity of L, L{r) > L(q'^), and q^ is obviously feasible for ALGD(q''), 
but we chose r to be optimal for ALGD(q'^), and the optimum is unique. 
Therefore q° = AW{q°). □ 

Theorem 5.4(v) is given by the following lemma. Recall that we are using 
the norm |x| =max„|xn|. 

Lemma 5.12. Given A G A, for any e > there exists an Hf, such that for 
every fluid model solution with arrival rate X, for which |q(0)| < 1, |q(t) — 
AW(q(t))| <e for allt>He. 
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Proof. The proof is inspired by Kelly and Williams [19], Theorem 5.2, 
Lemma 6.3. We start with some definitions. Let 

p = {q e : L(q) < L(l)} for L(-) as in Definition 5.3; 

X={qGP:AM^(q) =q}; 

= {q G P : |q — r| < 6 for some r E I}; 

J-, = {qGM+:|q-AVF(q)|<e}; 

/C5 = |qGP:L(q) -L(AW^(q)) < inf L{r) - L{AW{r))} . 

We will argue that the function K{q) = L(q) — L{AW{q)) is decreasing 
along fluid model trajectories, so once you hit ICs you stay there. We will 
then argue that X d fCs <zTs d Je for sufficiently small 5. Finally, we will 
bound the time it takes to hit /C^. 

K is decreasing. Lemma 5.5 says that for any fiuid model solution, 
L(q(-)) is decreasing. From Lemma 5.10, the feasible set for ALGD(q(n)) 
is a subset of the feasible set for ALGD(q(t)) for any u > t > 0, hence 
AW{q{u)) > AW{q{t)), that is, AW{c[{-)) is increasing. Therefore K is 
decreasing (not necessarily strictly). 

X (ZlC^ <ZX^ <Z Js- To show X cK-s- the map AW is continuous by The- 
orem 5.4(ii), and L(-) is clearly continuous, so K{-) is continuous; also the 
set V is compact by Theorem 5.4(i), and Xs is open, so 'D\Xs is compact; 
so the infimum in the definition of ICs is attained at some r & T>\Xs. Now, 
K{q) > for q G P \ I, so K{r) > 0. Yet K{q) = for q G X. Thus X C ICg. 

It is clear by construction that JCs CX^. 

To show Xs C Js- the map Al/^(-) is continuous, hence it is uniformly con- 
tinuous on the compact set D, so for any e > there exists a 5 > such that 

|q-r|<5 =^ \AW{q)-AW{r)\<e/2 forq,rGP. 
If q G X^ , then it is within 6 of some r G X, hence 

|q - AVF(q)| < |q - r| + |r - AVF(r)| + | AVF(r) - AW^(q)| 

<5 + + e/2 

< £ for 6 sufficiently small. 

Time to hit ICs- Consider first the rate of change of K{-) while the pro- 
cess is in D \ ICs 

k{q{t)) < L(q(t)) = A • /(q(t)) - maxTT • /(q(t)) 

ttGo 

(42) < sup A • /(r) — maxTT • /(r 



■n-eS 



< by Lemma 5.5. 

The supremum in (42) is of a continuous function of r, taken over a compact 
set; hence the supremum is attained at some r G X* \ /C^. If the supremum 
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were equal to 0, then A • /(r) = max^r vr • /(f), so r € X by Lemma 5.11; but 
f G P \ /C5, and we just proved that Z (ZK-s; hence the supremum is some 
< 0. 

Now consider any fluid model solution starting at q(0) with |q(0)| < 1. 
If q(0) G ICsi then it remains in so the theorem holds trivially. If not, 
then q(0) < 1 componentwise, so L(q(0)) < so q(0) € P; also L(q(t)) 

is decreasing so q(t) G T) for all t > 0. Now, K{c[{t)) < —rjs all the time that 
q(t) £ 'D\ fCs, and this cannot go on for longer than = ii'(q(0))/% < 
L{l)/V5- □ 

6. Fluid model behavior (multi-hop case). In this section we describe 
properties of fluid model solutions for a multi-hop switched network running 
MW-f back-pressure, as described in Section 2. 

Let R be the routing matrix and R= {I — R'^)~^; recall that Rmn = 1 if 
work injected at queue n eventually passes through m, and otherwise. For 
a vector x E M^, let x = i?x: for arrival rate vector A, A„ is the total arrival 
rate of work destined to pass through queue n; for a queue size vector q, Ijn 
is the total amount of work at queue n and queues upstream of n. 

The set A, the PRIMAL(-) and DUAL(-) problems, the set S* of virtual 
resources, and H(-) are defined as in the single-hop case. The difference 
is that we will require A G A, and we will define the set of critically loaded 
virtual resources to be H(A). We also need to modify the definition of ALGD 
and the lifting map. 

Definition 6.1 (Lifting map). With L:M^— as in the single-hop 
case, define the optimization problem ALGD(q) to be 

minimize L{r) over r G 

such that ^ • f > ^ • q for all ^ G H(A) and 

^ Qn for all n such that A„ = 0. 

Note that L is strictly convex and increasing componentwise, and the feasible 
region is convex; hence this problem has a unique optimizer. Define the lifting 
map AW — )• by setting A14^(q) to be the optimizer. 

The main result of this section is the following. Throughout this section 
we are considering a multi-hop network with arrival rate vector A > such 
that A G A, running MW-/ back-pressure. 

Theorem 6.2 (Portmanteau theorem, multi-hop version). The state- 
ments of Theorem 5.4 parts (i)-(v) hold, for multi-hop fluid model solutions 
and using the multi-hop definition of AW. 

Some of the proofs for the single-hop case carry through to the multi- 
hop case. Other proofs rely on the fact that for single-hop networks, A G 
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A =^ X < a for some a € (5), and these proofs require modification. We 
will modify them to use the following result. 

Lemma 6.3. Under Assumption 2.3, if cr £ (S) and a' G is such that 
a' < a, then a' € {S). 

Proof. It is sufficient to establish the result for the case when a' differs 
from a in only one component, as the repeated application of this will yield 
the full result. Without loss of generality, assume the queues are numbered 
such that < (T^ < o"! and a'^ = (T„ for n>2. Since cr S (5) there is a collec- 
tion of positive constants (a7r)7re5 such that X^^a-n- = 1 and a = Xl^aTrTi'. 
By Assumption 2.3, if tt G 5, then tt' € 5 where 

vr' = I °' '^^ri = l, 
" \ vr^, otherwise, 

thus a" G (S) where a" = X^^a-n-Tr'. By construction, a'l = and a'l^ = an 
for n > 2. By choosing the appropriate convex combination 

cr' = (1 - x)a" + xa with x = cr[/ai G [0, 1] 

we see cr' £ (S) . □ 

Now we proceed toward establishing Theorem 6.2. The proof of the first 
claim of Theorem 6.2(i) is just as for the single-hop case. The second claim 
follows from the following lemma. 



Lemma 6.4. For all f\£ 



pN 



(43) A-/(q)-max7r-(/-i?)/(q)<0. 

7re5 

Also, every fluid model solution satisfies 

^L(q(t)) = A • /(q(t)) - maxTT • (I - /Z)/(q(t)) < 0. 

Proof. Since A G A, RX < a componentwise for some cr G (S). Because 
R>0 and A > 0, RX > componentwise. By Lemma 6.3, A = (/ — R^)a' 
for some cr' G (5). Hence 

A • /(q) - maxTT • (/ - fl)/(q) = a' ■ {I - i?)/(q) - maxvr • (/ - R)f{q) < 0. 

For the claim about fluid model solutions, 

|L(q(t))=q(t)./(q(t)) 



A - (/ - R'^) 



Y^s„(t)7v + y{t) 



•/(qW) 
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by differentiating (21) 

= A • /(q(t)) - ^At)^ ■ {I - ^)/(qW) + y W • {I - R)f{^{t))- 

TT 

For tlie middle term, 

s^t)^ ■ {I - R)f{^{t)) = maxp • (/ - R)f{ci{t)) s^t) by (22) 

= maxp-(/-ii)/(q(t)) by (15). 
p 

For tlie last term, we claim that 

(44) y(t).(/-i?)/(q(t))=0. 

To see this, consider first a queue n with [(/ — i?)/(q(f))]„ > 0. As noted 
in (10), this implies f{qn{t)) > f{[Rq{t)]n)- By Assumption 2.2 it must be 
that qn{t) > 0, hence yn{t) = by (19). Second, consider a queue n with 
[(/ — i?)/(q(t))]„ < 0. It must be that all of the active schedules do not serve 
this queue, that is, S7r(t) > =^ 7r„ = 0, since otherwise by Assumption 2.3 
there is another schedule that has bigger weight than tt, contradicting (22). 
Third, if [(/ - R)fiq{t))U = then obviously y„(t)[(/ - i?)/(q(t))]n = 0. 
Putting these three together proves (44). 

Putting together these findings for the middle and last terms, 

^L(q(t)) = A • /(q(t)) - maxp • (/ - i?)/(q(t)). 
at p 

Applying (43) this is < 0. □ 

The proof of Theorem 6.2(ii) is broadly similar to the single-hop case. 
Lemma 5.7, but the formulae all have to be adjusted to deal with multi-hop. 

Lemma 6.5. AW is continuous. 

Proof. If H(A) is empty, then the lifting map is trivial, and the result is 
trivial. In what follows, we shall assume that H(A) is nonempty, and we will 
abbreviate it to H. Furthermore note that for every ^ G H we know ^ • A = 1 
by definition of H, and hence there is some queue n such that > and 

An>0. 

Pick any sequence q'"' — t- q, and let r'^ = AVK(q'') and r = AM^(q). We 
want to prove that r'^ — t- r. We shall first prove that there is a compact set 
[0,/i]'^ such that r'"' G [0,/i]^ for all k. We shall then prove that any conver- 
gent subsequence of r*^ converges to r; this establishes continuity of AW. 

First — compactness. A suitable value for h is 

n = max max sup — - — . 
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Note that the maximums are over a nonempty set, as noted at the beginning 
of the proof. Note also that h is finite because q is finite. Now, suppose that 
r'^ ^ [0, h]^ for some k, that is, that there is some queue n for which r,^ > h, 
and let r' = r*^ in every coordinate except for r'^ = h. We claim that r' satis- 
fies the two constraints of ALGD(q''). To see that it satisfies the second con- 
straint, note that r' < r'^, and hence if A.„ = 0, then < < . To see that 
it satisfies the first constraint, pick any ^ G H. Either = for all queues m 
that are downstream of n, that is, for which Rmn = 1; if this is so, then 

^r' = ^•r^ + ^(r'-r'=)=^•r'= + J;(r^-rf)^e™,i^m/=|•?^ 

I m 

Or > for some queue m that is downstream of n; if this is so, then 

^ • r ' > > imh > ^ • q*" by construction of h. 

Applying this repeatedly, if r'^ ^ [0, /i]^, then we can reduce it to a queue 
size vector in [0, /i]^, thereby improving on L{r^), yet still meeting the con- 
straints of ALGD(q'^); this contradicts the optimality of r'^. Hence r'^ G 

[o,hr. 

Next — convergence on subsequences. With a slight abuse of notation, let 
a convergent subsequence, and recall that AVF(q) = r 
and q*^ — t- q. By continuity of the constraints of ALGD, s is feasible for 
ALGD(q); we shall next show that L{s) < L{r). Since r is the unique opti- 
mum, it must be that s = r. 

It remains to show that L{s) < L{r). We will construct a sequence r — 
^fc _(_^fcp q£ candidate solutions to ALGD(q*^), choosing ^'^ > and e'^P > 
to ensure that the candidate solutions are feasible. Specifically, we define 

xfc ^ / 0, if Xn > 0, 

" \iqn-&, ifA„ = 0, 

and Pn = 1^ ^Q, and 

k (^q'-^q)++g•f 

e = max . 

|GB ^ . p 

We will first deal with the feasibility constraint that pertains when A„ = 0. 
Note that this implies Am = for all queues m that are upstream of n, since 
Xn = Ylm^rnXm-, and heucc that Am = for all upstream queues. Using 
this we find 

[i?(r-5'^ + e'=P)]n 

m 
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= ^ Rnm{rm " (^m " 9^)"^) since Xm = when Rnm = 1 

m 




> for all m 



= rn- {qn - Qn)^ 

^Qn — (Qu — (^)~^ since r is feasible for ALGD(q) 

Hence r — + e^P satisfies the second feasibility constraint of ALGD(q^). 
For the other feasibility constraint of ALGD(q'^), pick any ^ G H. Then 

^ ■ R{r - S'' + e^P) 

-*k -*k 

>^-(r — 5) + (^-q' — ^•q)^+^-<5 by construction of 

> ^ • q — • q'"' — ^ • q)"*" since r is feasible for ALGD(q) 

= max(^-q,$-q^) >,t-q^ 

Since the candidates are feasible solutions to ALGD(q'''), and r*"' is an opti- 
mal solution, it must be that 

L{r^)<L{T-5^ + e''P). 

Taking the limit as /c — t- oo , and noting that L is continuous and 6^ ^0 and 
£^ — )■ 0, we find 

L(s) < L(r) 
as required. This completes the proof. □ 

For the proof of Theorem 6.2(iii), it is useful to work with a different 
representation of AVF, provided by the following lemma, which draws on 
monotonicity of S. 

Lemma 6.6. For any qG AW(q) can be written 

Aiy(q) = q + t(A - (/ - R^)a) for some i > 0, cr G (5). 

Proof, we will choose a simply by multiplying each side of the desired 
equation by R 

r = q+t(A — cr) where r = AP^(q) 
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or, rearranging, 

cr = A-(r-q)A. 

We will show that < cr < p for some p G (S), hence by Lemma 6.3 cr G (S). 

First, we show < cr. If A„ > this can be achieved by choosing t suffi- 
ciently large. If = 0, then by the second constraint of ALGD(q) we know 
that fn < Qn so an > 0. 

Second, we show <t • ^ < 1 for all ^ that are feasible for DUAL(A). Either 

^ • A = 1, in which case ^ G (H(A)) and so by the first constraint of ALGD 

we know that r • ^ > q • ^. Or ^ • A < 1, in which case we simply need to 
choose t sufficiently large. Either way, o" • ^ < 1 for all dual-feasible ^, hence 
DUAL(cr) < 1, hence PRIMAL(<t) < 1, hence ct < p for some p G (5) by the 
definition of PRIMAL(cr). □ 

The proof of Theorem 6.2(iii) is given by the following lemma. This proof 
is similar to the single-hop case, Lemma 5.9, but it is much shorter because 
the monotonicity assumption gives us a stronger representation of the lifting 
map. Lemma 6.6. Also, this version makes a weaker claim, namely that 
the lifting map is scale-invariant at invariant states, whereas the single-hop 
version shows that the lifting map is invariant everywhere. 

Lemma 6.7 ( Scale-invar iance of the lifting map). // q = ATy(q) then 
Kq = AW{Kq) for all k>0. 

Proof. Suppose that q = AVF(q), and let Kr = AW{Kq). Clearly ^q 
is feasible for ALGD(Kq); we shall show that L{k,t) > L(Kq), whence Kq is 
also optimal for ALGD(K;q), whence nq = kt by uniqueness of the optimum. 

It remains to prove that L[k,t) > L{Kq). By Lemma 6.6, we can write kf as 

Kr = Kq + t{X - (I - R^)ct) 

for some t>0 and some cr G S. Now consider the value of L along a straight- 
line trajectory from Kq to Kr 

-^L(Kq + (A - (/ - R^)(7)u) 

= {X-iI-R')cr).f{Kq) 
= \-f{Kq)-a-iI-R)f{Kq) 

> A • /(Kq) — max p ■ {I — R)f{Kq) for any a G (5) 

p&S 

= 0. 

The final equality is because 

A • /(q) = A • (/ - R)f{q) = maxTT • (/ - R)f{q) 
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by Lemma 6.9(iv) below (the proof of which does not assume the result of 
this lemma). Hence 

A • /(/tq) = A • (/ - fi)/(Kq) = maxTT • (/ - /?)/(Kq) 

using Assumption 2.2 and the fact that A G (5) by Lemma 6.3. □ 

The proof of Theorem 6.2(iv) relies on the following lemma. 

Lemma 6.8 (Fluid model trajectories preserve ALGD feasibility). Con- 
sider any fluid model solution, for any scheduling policy, with initial queue 
size q(0). Then q(t) is feasible for ALGD(q(0)) for all t > 0. 

Proof. Feasibility for ALGD(q(0)) has two parts. For the first part, 
pick any critically loaded virtual resource ^ G ^(A), and multiply each side 
of (21) by ^ = (/ - R^)-'^ and then by ^ to get 

$ ■ q{t) = ^ . q(0) + ^ • R^{t) - ^ • s„{t)7V - yit)] . 

Defining cr{t) = '^7vs.„{t)/t, which is in (S) by (15), 
^ • q{t) > ^ ■ q(0) + 1(^ • A - ^ • a{t)) by (14) and because y(t) > 
= ^-q(0)+t(l-^ •o-(t)) since ^GS(A) 
^ ^ ■ q(0) + t(l — 1) since ^ is a virtual resource and a £ {S) 

= ^q(o) 

as required for the first part of ALGD-feasibility. For the second part, sup- 
pose that A„ = for some queue n. Multiply each side of (21) by R to get 

q(t) = q(0) + At - ^ 7TS„{t) + y(t) < q(0) + At, 

TT 

where the inequality is by (16). Since we assumed = 0, qn{t) < g*«(0). This 
completes the proof that q(t) is feasible for ALGD(q(0)). □ 

The proof of Theorem 6.2(iv) is implied by parts (i) and (ii) of the fol- 
lowing lemma. 

Lemma 6.9 (Characterization of invariant states of MW-/ backpressure). 
The following are equivalent, for G : 

(i) qO = AVr(qO); 

(ii) q'' is an invariant state; 

(iii) there exists a fluid model solution with q(t) = q" for all t; 

(iv) A • /(qO) = max„e5 • (/ - i?)/(qO) . 
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Proof. That (i) (ii) (iii) =^ (iv) is proved in the same way as 
in the single-hop case. We just need to appeal to Lemma 6.4 rather than 5.5 
for the fact that L(q(t)) is decreasing, and to Lemma 6.8 rather than 5.10 
for the fact that q(t) remains feasible. 

Proof that (iv) =^ (i). Let r = AW'(q°). By Lemma 6.6, r = qO + t(A - 
(/ — Ej)a) for some t>0 and cr € (5) . By considering the value of -L(-) along 
the trajectory from q'' to r, and using (iv), we conclude that L(r) > L{qP). 
By the same argument as in the single-hop case, q*' = Al^(q'^). □ 

The proof of Theorem 6.2(v) is given by the following lemma. 

Lemma 6.10. Given A G A, for any e > there exists an > such 
that for every fluid model solution with arrival rate X, for which |q(0)| < 1, 
|q(t) - AW{q{t))\ < e for all t>He. 

Proof. The proof of Lemma 5.12 goes through almost verbatim. The 
only changes are in the penultimate paragraph, which should be replaced by 
the following: 

Time to hit /C5. Consider first the rate of change of K{-) while the pro- 
cess is in 2? \ K-s 

k{q{t)) < L(q(t)) = A • /(q(t)) - maxTT • (/ - i?)/(q(i)) 

ttGo 



< 



sup 



A-/(r)-max7r-(/-i?)/(r) 



reV\ICs ^^'^ 

< by Lemma 6.4. 

This supremum is of a continuous function of r, taken over a closed and 
bounded set, hence the supremum is attained at some f € P \ /C5. If the 
supremum were equal to 0, then A • /(r) = max,r7r • (/ — R)f{r) so f G X 
by Lemma 6.9; but r £ T> \ ICs and we just proved that I C K-s; hence the 
supremum is some —r]s<0. □ 

7. Multiplicative state-space collapse. This section establishes multiplica- 
tive state space collapse of queue size. It shows that under the MW-/ policy, 
and with suitable initial conditions when the network is not overloaded (i.e., 
when A € A), the appropriately normalized queue size vector is constrained 
to lie in or close to the set of invariant states 

X={qGM^:q= AVF(q)}. 

We assume that arrivals satisfy Assumption 2.5, and let the arrival rate 
vector A be as specified in that assumption. The function AW depends on A 
and /, as specified in Sections 5 and 6 for single- hop and multi-hop networks, 
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respectively, and the interesting case is where A G dA (since otherwise AW 
is trivial). 

This section mostly follows the method developed by Bramson [3], except 
that our proof avoids the need for regenerative assumptions on the arrival 
process by imposing slightly tighter bounds on the uniformity of their con- 
vergence, as expressed by Assumption 2.5. 

Consider a sequence of systems of the type described in Section 2.1 run- 
ning a scheduling policy of the type described in Section 2.2. Let the systems 
all have the same number of queues A^, the same set of allowed schedules S, 
the same routing matrix R and the same scheduling policy. Let the sequence 
of systems be indexed by r G N. Write 

X'-(r) = (Q^(t), A'-(r), Z^(r), 5^(t)), r G Z+, 

for the rth system. Define the scaled system x^{t) = {q^ (t) , sJ' (t) , z"^ (t) , (t)) 
for t G M+ by 

c{{t) = Q'^{rh)/r, a'-(t) = A'^{rh)/r, 

z^{t) = Z^{rH)/r, sl{t) = Sl{rH)/r 

after extending the domain of X'^{-) to ]R_|_ by linear interpolation in each 
interval (r, r -|- 1). Note that each sample path of a scaled system x^(i) over 
the interval t G [0,r] lies in C^T) with I = ^N + \S\. T > will be fixed 
for the remainder of this section. Recall the norm = supo<t<2'|2;(t)|. The 
main result of this paper is the following. 

Theorem 7.1 (Multiplicative state-space collapse). Consider a sequence 
of (single-hop or multi-hop) switched networks indexed by r £ N, operating 
under the MW-f policy (with f satisfying Assumptions 2.1 or 2.2 and S 
with Assumption 2.3), as described above. Assume that the arrival processes 
satisfy Assumption 2.5 with A G A. Also assume that the initial queue sizes 
are nonrandom, and satisfy Y\mj.^ooqf {{)) = qo for some qo G T. Then for 
any 5 > 

V llq'lOllvi / 

Simulations suggest that a stronger result holds in the widely-studied 
diffusion or heavy traffic scaling. A'' = A — T/r for some nontrivial T G 
and A G 3A. We conjecture the following. 

Conjecture 7.2. Under the assumptions of Theorem 7.1 and the ad- 
ditional assumption that increments in the arrival process are i.i.d. and uni- 
formly bounded, under the diffusion scaling for any 6 >Q 

(46) P(||q'^(-)-AVF(q'^(-))|| <(5)^1 asr^oo. 
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Fig. 2. Splitting the process into flmd-scaled parts, starting at 0, r, 2r, . . . . 

7.1. Outline of the proof of Theorem 7.1. The outline of the proof of 
Theorem 7.1 is as follows. We are interested in the dynamics of q^(t) over 
t G [0,T], that is, of Q''(t) over r G [0, r^T]. We will split this time interval 
into [rTj + 1 pieces starting at 0, r, 2r, . . . , and look at each piece under 

a fluid scaling. We will define a "good event" Er under which the arrivals in 
all of the pieces are well behaved (Section 7.1.1). We then apply Theorem 4.3 
to deduce that, under this event, the queue size process in each of the pieces 
can be (uniformly) approximated by a fluid model solution (Lemma 7.3). We 
then use the properties of the fluid model solution stated in Theorem 5.4 
to show that in each of the pieces, the queue size is (uniformly) close to 
the set of invariant states (Lemmas 7.4 and 7.5). Figure 2 depicts the idea. 
Finally we show that P(-E'r) 1 (Lemma 7.6). The formal proof is given in 
Section 7.1.2. 

Note that Lemmas 7.3-7.5 are all sample path-wise results that hold for 
every uj € Er, and so questions of independence etc. do not arise. The only 
part of the proof where probability comes in is Lemma 7.6. 

The proof is written out for a single-hop switched network. For the multi- 
hop case, the argument holds verbatim; simply replace all references to the 
single-hop fluid limit Theorem 4.3 by references to the equivalent multi-hop 
result, and replace all references to the description of single-hop fluid model 
solutions in Theorem 5.4 by references to the multi-hop version Theorem 6.2. 

7.1.1. The good event and the fluid- scaled pieces. Define the fluid-scaled 
pieces 5;^'™'^(tt) = (q^''"'^(u),a'''™'^(ii),y'''"^'^(u),s''''^'^(M)) of the original pro- 
cess by 

q™(^/) = Q^(rm + ztx)/z, 

a'"'"^'^(ti) = (A''(rm + zu) - A'^{rm))/z, 
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yr,^,^(^u) = (Y^(rm + zu) - Y^(rm))/z, 

for < m < [rTj , z > r and n > 0. Here r indicates which process we are con- 
sidering, m indicates the piece and z indicates the fluid-scahng parameter. 
The scahng parameter z^.m = |Q^(rm)| Vr is particularly important, and for 
convenience we will define x^'"^{u) = (q^'™(ti), a^'''"(n), y''''"(n), by 

q^''" (u) = q'-'™'^'-.- (u) , a'^'" (u) = a'^'™'^'->- (n) , 
The good event is defined to be 



(47) 



Er = \ sup |a^'™'"''-.fc (u) - A^-ul < r]r for all < m < [rTj 
and < A; < [Lr logrj , where w^^k = ''(I + A;/logr)|. 



By this, we mean that E,. is a subset of the sample space for the rth system, 
and we write x'"'™''"''''*^ (•)(w) etc. for cj E Er when we wish to emphasize the 
dependence on Er • The constants here are T^'^''^ = (2 + A"'''^ + TVS"™^^) (i?^ + 
1), X^^^ = sup^jA'^'l, C > is chosen as specified in Section 7.1.2 below, 
H(; is chosen as in Theorem 5.4(v), L = 1 + r(l + A™^^ + iV5'^''^), 5™^^ = 
max7re5 |7r| and the sequence of deviation terms rjr G [0, 1] is chosen as spec- 
ified in Lemma 7.6 such that ry^. — )• as r — )■ oo . 

Lemma 7.3. Let FMS be the set of fluid model solutions over time hori- 
zon [0,r^"^'^] for arrival rate vector A, and let FMS(qo) and FMSi be as 
specified in Definition 4-1- Then 

(48) sup max (i(5'^''"(-)(w), FMSi) ^ as r ^ oo 

^6^^0<m<LrTJ 

and 

(49) sup (i(5'^'°(-)(a;),FMS(qo))^0 as r — >• oo , 

where qo = qo/(|qo| V 1). 

Proof. The proof of each equation will use Theorem 4.3. We start 
with (48). The theorem requires the use of an index j in some totally or- 
dered countable set; here we shall use the pair j = (r, m) ordered lexico- 
graphically, where r € N and < m < [rTj . Lexicographic ordering means 
(r, m) > (r', m') iff either r > r' or both r = r' and m>m'. Note that j — )• oo 
implies r — t- oo (and vice versa) . 

To apply the theorem, we first need to pick constants. Let K = 1, let A 
and A-^ = A'' — )• A as per Assumption 2.5 and let Ej =rjr{l + 1/logr) so that 
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£j —7- 0. Thus condition (23) of Theorem 4.3 is satisfied. Now let 

It is worth stressing that Gj = Gr,m is a set of sample paths and associated 
scaling parameters, not a probabilistic event, and so any questions about 
the lack of independence between x^''"(-)(w) and Zr^mi^) are void. Note also 
that although the events Er lie in different probability spaces for each r, this 
has no bearing on the definition of Gj nor on the application of Theorem 4.3. 

We next show that Gj satisfies conditions (24)-(26) of Theorem 4.3, for j 
sufficiently large. Equation (24) follows straightforwardly from the fact that 
Zr,m > r, hence mf{z : {x, z) G Gr,m} > f, hence mi{z : {x, z) G Gj} — >■ oo. 
For (25), later in the proof we will establish that, under Ej. for r large 
enough, 

(50) sup \a'''"'{t)- yt\<7]r(l + -^] for aU 0<m< [rTj, 



which implies that for all {x,z) G Gj = Gj-^m, supjgjg j^flmdjla-^Xt) — XH\ < Sj 
as required. Equation (26) follows straightforwardly from the scaling used 

to define q-'(O) = q'^''™(0): for every oj, not merely uj G Er, 



|q^'-(0)| 



Q^(rm) 




Q'"(rm) 






Q'"(r?Ti) V r 



< 1. 



Since Gj satisfies the conditions of Theorem 4.3 for sufficiently large j, 
we can apply that theorem to deduce 

sup d(x, FMSi) — > as j — )■ oo. 

(£,2)GGj 

Rewriting j as (r,m), and turning the limit statement into a limsup state- 
ment, 

sup sup (i(x, FMSi) — > as r — > oo 

(r',m)>(r,0) {x,z)eG,,^^ 

and in particular 

max sup (i(x, FMSi) — 7- asr— t-oo. 

0<m<LrTJ (x,z)€Gr,,n 

Rewriting {x,z) G Gr.m in terms of a; G Er, as per the definition of Gr,m, 
max sup (i(x''™(-)(t^),FMSi) ^0 asr^oo. 

0<m<[rTJ ^fp 

Interchanging the max and the sup gives (48). 

To establish (49), we will again apply Theorem 4.3 but this time using 
the index j = r. A-' = A'' — t- A and Sj = r]r{l + 1/logr) as above, and define 

Gj =Gr = {(5'-'°(-)(^), ^r,0(^)) G Er}. 
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Equations (23)-(26) hold just as before. For (28), we will use qo as in the 
statement of this lemma, and e'j = |q'^'''(0) — qo|. This is a well-defined con- 
stant (i.e., it does not depend on the randomness w), because we assumed 
in Theorem 7.1 that the initial queue sizes Q''(0) are nonrandom, and by 
definition q'''°(0) = Q''(0)/(|Q''(0)| V r). Furthermore, Theorem 7.1 assumes 
q''(0) — )■ qo, which implies q'''''(0) — )• qo hence e'^ — )• 0. Equation (28) then 

follows straightforwardly, for every uj not merely lo £ E,.. Applying Theo- 
rem 4.3, we deduce that 



Equivalently, 



sup (i(x, FMS(qo)) — )• asj— )'00. 



sup d(F'°(-)(w),FMS(qo))^0 asr^cx) 



as required. 

To complete the proof of Lemma 7.3, the only remaining claim that needs 
to be established is (50). We will proceed in two steps. First we prove that 
I (rm) I < Lr^ under Er , for r sufficiently large and for all < m < [rTj . 
To see this, note from (7) that 

Q''(rm) < Q''(0) + A'^(rm) + NrmS'^'^. 

Now Er gives a suitable bound on arrivals: for all <m' < [rT\ , and using 
the fact that 1 < T^''''^, 

{rm! -\- r) — [rm!) 



and by applying this from m' = to m' = m — 1 we find | A''(rm)| < rm[X^^^ 
rjr). The assumptions of Theorem 7.1 tell us that Q'"(0)/r — >• qo for some 
qo G • Putting all this together, we find that for sufficiently large r 

|Q''(rm)| < r2(l + T{1 + A"'^^ + AS""^^)) = Lr^ for any < m < [rTj . 

Now we proceed to prove (50), under Er for r sufficiently large. Observe that 
(for r > 2) there exists k £ {I, . . . , [Lr logrj} such that Wr,k-i < -^r.m < Wr,k] 
this follows from r < Zr^m = \ Q,^ (rm) | V r < Lr^ and the definition of Wr^k 
in (47). Hence for any t G [O.r'l^"'^], 



A^'{rm + Zr^rnt) — AT [rm) 



\a'''^''"-'^{u)-yu 
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Wrk " 



< rjr since holds and u<t < T^^^'^ 



Wr k 

< r]r '■ since Zr,m > Wr,k-1 

Wr,k-1 

= Vr(l + l 7^-1 )<Vr(l+ ^ 



k — 1 + log r J \ log r ^ 
This establishes (50) and completes the proof. □ 

Lemma 7.4 (Choice of approximating piece). Given t G [0, T] and r G N, 
define m* = m*{r,t) and u* = u*{r,t) by 

m = mm{m £ : rm < r t < rm + I Zr^mf, u = • 

This is a sound definition (i.e., the set for m* is nonempty). Further, under 
event Er, either m* = and <u* < T^"id^ or < m* < [rT\ and < 

u* < rfl'^id^ 

Proof. The set for m* is nonempty because z^.m ^ f and > l. The 
upper bound for m* is trivial. The upper bound for u* in either case is trivial. 
To prove the lower bound for u* when rn* > 0, r'^t > r{rn* — 1) +T'^'^^'^2:j. „i*— i 
due to the minimality of m* . Hence 

_ r^t — rm* ^ T^"^'^Zr,m*-l — r ^ rpRmd ^r^m.*-! _ ^ 
Zr,m* Zf^rri* ^r,m* 

To bound Zr^m*-i/ Zr,m* , we can use (7) and the bound on a''''"'''^'''0(l) pro- 
vided by Er to show that for any m 
Zr,m = |Q''(rm)| Vr 

< {\Q''{rm - r)| + r(A'"^^ + r]r + iV5"^^^)) V r 

< \Q''{rm - r)| V r + r(A"'^^ + 1 + iV5"'^^) since i]r < 1 

< Zr,m-1 (2 + A'^^^ + iV5"^^^) since Zr,m-1 > r. 

Substituting this back into the earlier bound for u* , 

T-ifiuid 

U* > - 1 

2 + A™^^ + iVS'™^^ 

and this is equal to by choice of t'^^"'^. □ 

Lemma 7.5 (Pathwise multiplicative state space collapse). Let < C < 1, 
t £ [0,T] and r £ 'N be given. Suppose there exist m G {0, . . . , [rTj}, u G 
[0, r'''^''^] and X G FMS such that r'^t = rm + Zr^mU and \\x^'"^ — x\\ < ^, and 
furthermore either (i) ?ti > and u > and x G FMSi, or (ii) m = and 
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X G FMS(qo) where qo is as defined in Lemma 7.3. Then 

where mC(^(AVF) is the modulus of continuity of the map qi— t- AVF(q) over 

P = {q' G : |q' - q| < 1 for some q suc/i that L{q) < L{1)} 
for L(-) as in Definition 5.3. 

Proof. The first inequality is trivially true because 

Zr,m |Q'^(rm)| Vr f -r^A,,. 
— — = < sup q (u) V 1. 

r r ^ue[o,T] ' 

For the second inequality, note that after unwrapping the q''(-) scaling and 
wrapping it up again in the q'"''" scaling, the middle term in (51) is 

MT = I q^'"^ (-u) - AP^^ (q'-'™ (n) ) I . 

Writing q for the queue component of x, 

MT < |q^'™(u) - q(n)| + |q(n) - AVr(q(u))| + |AW^(q(n)) - AT^(q'-''"(n))| 

= (52a) + (52b) + (52c) respectively. 
We can bound each term as follows: 

(52a) is < C since — x|| < (" by an assumption of the lemma. 

(52b) is < in the case m > 0: by the assumptions of the lemma, 
X G FMSi so |q(0)| < 1, and also u > ff^. The requirements of The- 
orem 5.4(v) are met; hence we obtain the inequality. 

is = in the case m* = 0: In this case, by assumption of the lemma 
X G FMS(qo), that is, q(0) = qo- By assumption of Theorem 7.1 
qo G X, that is, qo = Al^(qo), therefore by Theorem 5.4(iii) qo G X, 
therefore by Theorem 5.4(iv) the fluid model solution q(-) stays 
constant at qo and so (52b) = 0. 

(52c) is < mc^(AVK): by assumption of the lemma, either m > and 
X G FMSi, or m = and x G FMS(qo) where qo < 1 component- 
wise; either way q(0) < 1 componentwise, so L(q(0)) < L{1). By 
Theorem 5.4(i) L(q(ii)) < L{1) so q(ti) G T>. Furthermore, since 
II ~r,m _^ II ^ ijy assumption of the lemma, |q'''™(ii) — q(ii)| < ^ < 1 
and so q^''^{u) G T>. The inequality then follows from the definition 
of the modulus of continuity. □ 

Lemma 7.6 (The good event has high probability). Under the assump- 
tions of Theorem 7.1, P(-E'r) —^1 as r ^ oo. The deviation terms are given 
by r]r = min(l, sup^>^ T^^^'^S ^^j^nuid^) and — )• as r ^ oo. 
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Proof. By a simple union bound, and then using the fact that the 
arrival process has stationary increments, 

[rTJ [Lr log rj 



nEr)>i-J2 E 



sup \a''^"'^'"-'^{u)-X'u\>r], 

[Lr log r J 

1 - (1 + [rTJ) Pf sup |a'^'°'"''-.'=(^^) - -^'^'"1 > 



[Lr log r J 



l-(l+rr) J] 



sup 



^''(W^r,fc'") 



Wr,k 



> 



To bound this we will use Assumption 2.5, which says that 

z(logz)¥(sup-|A''(r) - AV| >(^J ^0 as 2; — 7- oo 

\t<z Z / 

uniformly in r. After extending the domain of A^' to M+ by linear interpo- 
lation in each interval (r, r + 1), and extending the domain of 6z to z G M+ 

by 6{z) = S\^z] V (^[^] , and rescaling z by T^^^'^^ 

A'-(zu) 



z{logz) F[ sup 

ne[0,Tfl"id] 



> T^^^id^^^r^flmd^ J ^ Q as Z ^ 00 



uniformly in r. In other words, for any (j) > there exists zq such that for 
all z > zq and all r, 

A''{zu) 



sup 

«G[0,Tfl"id] 



2;(logz)^ 



Now pick ro sufficiently large that tq > and sup^>f, 
which we can do since 6{z) — as z — )• 00 by Assumption 2.5. This choice 
implies that for any r > rg and z>r, 7"fluidj^^jifluid^ ^ [recall that 7]r = 
min(l,sup^>j.r'''^''^(5|^^flmd|)]. Hence, for any r > ro and z>r, 



'[^TfluidJ, 

sup 



>Vr] < 



2:(logz)2 



Applying this bound to F(£^,.), and using the facts that Wr^k > f and 
Wr,k{^ogWr,kf > r{l + k/logr){logrf, 



1 - F{Er) < 



[Lr log r 



1 + [rTJ 
r(logr)2 

1 + [rTJ 
r log r J ^—^ k + log r 



y I 

^—i I + k/ log r 

h-\-]r 



k=0 
[Lr log r 
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r log r J Ji^Q £ - 1 + log r 
l+k^JAj^g^^ [Lr\ogr\ 



r log r J \ log r — 1 

The final expression converges to (j)T as r — t- oo. Since cj) can be chosen 
arbitrarily small, P(-E'r-) — s- 1 as r — t- oo. □ 

7.1.2. Proof of Theorem 7.1. Given 6 > 0, pick C > such that 2( + 
mc^(AP^) < 5 Al where mc^(Al^) is the modulus of continuity of AW 
over the set D specified in Lemma 7.5. We can achieve the desired bound 
by making ( sufficiently small; this is because AW is continuous, hence 
uniformly continuous on compact sets, and D is compact as a consequence 
of Theorem 5.4(i), hence mc^(ATy) — )■ as C — )■ 0. With this choice of C, 
define the good sets Er and the constants T^^^'^ and as specified by (47). 

By Lemma 7.3, there exists rg such that for r > ro and for all co € Er and 
all m, (i(F''",FMSi) < C and FMS(qo)) < C, where qo is defined in 

the statement of the lemma. 

Now, pick any t S [0,T^"''^] and r > tq, and assume Er holds. Lemma 7.4 
says that we can choose m G {0, . . . , [rTj } and u G [0, T^"'*^] such that r^t = 
rm + Zr^mU, and furthermore either (i) m> and u > or (ii) m = 0. By 
Lemma 7.3, we can pick x S FMS (depending on r, t and the w) such that 
II ~r,m _ 2;|| < ^ and furthermore either (i) m > and x G FMSi or (ii) m = 
and X G FMS(qo). Then, by Lemma 7.5, 

mt)-Awmm,. 

||q^(-)||Vl 

This bound holds for every t G [0, T] and r > ro, in a sample path- wise sense, 
whenever u £ Er- 

Finally, Lemma 7.6 says that ¥{Er) 1- This completes the proof. □ 

8. An optimal policy? Our motivation for this work was Conjecture 3.1, 
which says that for an input-queued switch the performance of MW-a im- 
proves as a \ 0. We have not been able to prove this. However, under 
a condition on the arrival rate, we can show (i) that the critically- loaded 
fluid model solutions for a single-hop switched network approach optimal 
(in the sense of minimizing total amount of work in the network) as a \ 0; 
and (ii) that for an input-queued switch the set of invariant states I de- 
fined in Section 7 is sensitive to a. We speculate that these findings might 
eventually form part of a proof of a heavy traffic limit theorem supporting 
Conjecture 3.1, given that critically loaded fluid models and invariant states 
play an important role in heavy traffic theorems. 
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In this section we state the condition on the arrival rates, and give the 
results (i) and (ii). Motivated by these results, we make a conjecture about 
an optimal scheduling policy. 

Definition 8.1 (Complete loading). Consider a switched network with 
arrival rate vector A. Say that A satisfies the complete loading condition if 
A G A, and there is a convex combination of critically loaded virtual resources 
that gives equal weight to each queue; in other words if 

^— G(H(A)). 

Theorem 8.2 (Near-optimality of fluid models under complete loading). 
Consider a single-hop switched network with arrival rate vector A G A. 

(i) For any fluid model solution for the MW-a policy, 1 • q(t) < 
Ar°/(i+«)i.q(o). 

(ii) For any fluid model solution for any scheduling policy, if A satisfies 
the complete loading condition, then 1 • q(t) > 1 • q(0). 

Proof. The first claim relies on the standard result that for any x G 
and /3 > 1, 

n ^ n ^ n 

Using the Lyapunov function from Definition 5.3, 

1 • q(t) < ivi-V(i+") / g„(t)i+° J by the first inequality in (53) 

^ n ^ 

= iV°/(i+°)L(q(t))i/(^+") by definition of L(-) 

< iV°/(i+")L(q(0))^/(^+") since L(q(t)) < by Theorem 5.4(i) 

< iV"/ (^+") 1 • q(0) by the second inequality in (53) . 

The second claim is a simple consequence of Lemma 5.10. (This lemma is 
for a single-hop network. The multi-hop version, Lemma 6.8, does not have 
such a simple interpretation.) □ 

Theorem 8.3 (X is sensitive to a for an input-queued switch). Consider 
an M X M input-queued switch running MW-a, as introduced in Section 2.4- 
Let \ij be the arrival rate at the queue at input port i of packets destined for 
output port J, A G M^^^^^. Suppose that A > componentwise, and further- 
more that every input port and every output port is critically loaded, that 
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is, 

M M 

(54) Xij = 1 and Ajj = 1 for every 1 <i,j < M. 

j=i i=i 

Then X satisfies the complete loading condition, and the critically loaded 
virtual resources are 

H(A) = {r,- for alll<i<M}U {cj for all I < j < A/}, 

where rj and Cj are the row and column indicator matrices, (ri)i,j = li=j 
and (cj)ij = Define the workload map W : Rf^ by W(q) = 

• q]^g=(_\) . Denoting the invariant set hyX[a): 

(i) if w is in the relative interior o/ {l^(q) : q G M^^^*^}, then w is in 
{T^(q) : q G T[a)} for sufficiently small a > 0; 

(ii) for a 2 X 2 input-queued switch, {W{af) : q G 2^(a)} is strictly increas- 
ing as Q \ 0. 

Item (i) essentially says that W{X{a)) becomes as large as possible as 
a \ 0, except for some possible issues at the boundary. We have only been 
able to prove (ii) for a 2 x 2 switch, but we believe it holds for any M x M 
switch. The proofs are rather long, and depend on the specific structure of 
the input-queued switch, so they are left to the Appendix. 

Conjecture 3.1 claims that, for an input-queued switch, performance im- 
proves as a \ 0. Examples due to Ji, Athanasopoulou and Srikant [15] and 
Stolyar (personal communication) show that this is not true for general 
switched networks. However, Theorem 8.2 suggests that the conjecture might 
apply not just to input-queued switches but also to generalized switches un- 
der the complete loading condition; the examples of Ji, Athanasopoulou and 
Srikant [15] and Stolyar do not satisfy this condition. We therefore extend 
Conjecture 3.1 as follows. 

Conjecture 8.4. Consider a general single-hop switched network as 
described in Section 2, running MW-a. Consider the diffusion scaling limit 
(described in Conjecture 7.2), and let X be the limiting arrival rates; as- 
sume X satisfies the complete loading condition. For every a > there is 
a limiting stationary queue size distribution. The expected value of the sum 
of queue sizes under this distribution is nonincreasing as a \ 0. 

Theorem 8.2 says that MW-a approaches optimal as a \ 0, under the 
complete loading condition. It is natural to ask if there is a policy that is 
optimal, rather than just a sequence of policies that approach optimal. Given 
that MW-a chooses a schedule tt to maximize tt • q° (where the exponent 
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is taken componentwise), and since 

a _ f 1 + alogx + O(a^), if X > 0, 
~\0, if2; = 0, 

we propose the following formal limit policy, which we call MSMW-log: at 
each time step, look at all maximum-size schedules, that is, those n £ S for 
which X]„7i"nlQ„>o is maximal. Among these, pick one which has maximal 
log-weight, that is, for which Q^^^QT^n^ogQn is maximal, breaking ties 
randomly. 

Conjecture 8.5. Consider a general single-hop switched network run- 
ning MSMW-log. Consider the diffusion scaling limit, and let A be the lim- 
iting arrival rates; assume A satisfies the complete loading condition. There 
is a limiting stationary queue size distribution. This distribution minimizes 
the expected value of the sum of the queue sizes, over all scheduling policies 
for which this expected value is defined. 

Scheduling policies based on MW are computationally difficult to imple- 
ment because there are so many comparisons to be made. In future work 
we plan to investigate whether the techniques described in this paper can 
be applied to policies that may have worse performance but simpler imple- 
mentation. 

APPENDIX: RESULTS FOR INPUT-QUEUED SWITCHES 

In this section we prove Theorem 8.3. Throughout this Appendix we are 
considering a M x M input-queued switch. The set of schedules S consists 
of all M X M permutation matrices. We assume the arrival rate matrix A 
satisfies the complete loading condition (54) and that A > componentwise. 
We let the scheduling algorithm be MW-a, and define X(a) to be the set of 
invariant states. 

A.l. Identifying A, «S*, H and H"'". The Birkhoff-von Neumann decom- 
position result says that a matrix is doubly substochastic if and only if it is 
less than or equal to a convex combination of permutation matrices, which 
yields 

{MM \ 
A G [0, l]^^xA^ : Xij < 1 and ^ A,j < 1 for all i,j \ . 
j=i i=i ) 

Since A satisfies the complete loading condition (54), A G A. 

Lemma A.l below gives S* , the set of virtual resources, that is, maximal 
extreme points of the set of feasible solutions to DUAL(A). From the com- 
plete loading condition, it is clear that H(A) = S* as claimed in the theorem. 
It will also be useful, for the proof of Theorem 8.3(i), to identify H+(A) as 
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defined by (34). We claim that 

(55) S+(A) = H(A). 

To see this, suppose ^ is a nonmaximal extreme point of the set of feasible 
solutions to DUAL(A); then there exists some other extreme point C such 
that ^ < C S'lid ^ 7^ Ci but because A > componentwise it must be that 
^ • A < • A. We have found that A G A, so the solution to DUAL(A) is < 1, 
hence ^ • A < 1. Therefore $, ^ H"^(A), that is, H"''(A) consists only of maximal 
extreme points. 

Lemma A.l. The set of maximal extreme points of the set 
F = {$e rI^""^'^ : ^ • tt < 1 /or a// tt G 5} 

is given by 

S* = {rj for all l<i<M}U {cj for all l<j< M}, 

where the row and column indicator matrices and cj are defined by {^ijij = 
^i=i and i^j^ij — ^j=j' 

Proof. First we argue that every £ S* is a maximal extreme point 
of F. It is simple to check that ^ £ F. Also, ^ is extreme because F C 
[0,1]*'^^*^. Finally, ^ is maximal because if it were not then there would 
be some e > 0, e ^ 0, such that ^ + e £ F; but for any such e there is 
a matching vr such that e • vr > hence ^ + e ^ F. 

Next we argue the converse, that all maximal extreme points of F are 
in S* . The first step is to characterize the extreme points of F. We claim 
that if £ F, then it can be written C ^ ^ ^or some ^ £ {S*). Consider the 
optimization problem 

M M 

minimize + over Xi>0, yj>0 for all z, j 

(56) „ „ 
such that C < ^i^i + yjcj- 

« i 

The dual of this problem is 

maximize a • C, over a £ 

(57) 

such that a • rj < 1, a • cj < 1 for all 

[These problems are just PRIMAL((^) and DUAL(^), resp., but with respect 
to the "virtual" schedule set S* rather than the actual schedule set 5.] By 
Slater's condition, strong duality holds. Now, any matrix a that is feasible 
for (57) is nonnegative and doubly substochastic, hence by the Birkhoff- 
von Neumann decomposition result it can be written as a a < b where b 
is a convex combination of permutation matrices, that is, b £ {S). But by 
the assumption that C, £ F^ C " ^ 1 foi" 7V £ S, hence b • < 1, hence 
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a • C ^ 1) so the value of the optimization problem (57) is < 1. By strong 
duality, the value of the optimization problem (56) is also < 1. Therefore 
< ^ for some ^ G (5). 

We now claim that if C G F then it can be written = a^^ where the 
sum is over some finite collection of values drawn from the set 

E = {^e : ^ < r? or ^ < cj for some 

and all are > 0. We have just shown that 

(58) C = ~ 2; for some z > and > 0, ^ £ E. 

If z = we are done. Otherwise, pick some (A;, /) such that z^^i > and 
define n'='' by {n'''%j = 1 - 1 .^^ ^^^^ Noting that H^k,l > Zk,i > 0, we 
can rewrite ^ as 

where matrix multiplication is componentwise as per the notation specified 
in Section 1. We have thus rewritten ^ in the form (58), but now z has one 
fewer nonzero element. Continuing in this way we can remove all nonzero 
elements of z, until we are left with ^ G (E). 

We have therefore shown that all extreme points of F are in E. Clearly, 
all maximal points of E are in S* . Therefore, all the maximal extreme points 
of F are in 5* as claimed. □ 

A.2. Proof of Theorem 8.3 (i). We first state two lemmas which will be 
needed in the proof. The first is a general closure property of permutation 
matrices, and the second is a property of invariant states of MW-a. We then 
prove the theorem and the two lemmas. 

Lemma A. 2. Let y: e M^^''*^ and define A G R^^''^^ by Aij = 1 if 
there is some matching p £ S whose weight p ■ x is maximal (i.e., p • x = 
maXo-G5 cr ■ x) and for which pij = 1; and Aij = otherwise. Then, for any 
matching tv such that 

TV is itself a maximum weight matching. 

Lemma A. 3. Fix any A G A, and any q G I{a). For every 1 < i, j < M 
such that Xij > 0, there exists a matching n £ S whose weight tt • q" is 
maximal (i.e., tt • q" = maXo-G5cr • q"? where the exponent is taken compo- 
nentwise) and for which iTij = 1. 

Proof of Theorem 8.3(i). Suppose the claim of the theorem is not 
true, that is, that there exists a sequence a \ such that w ^ >V(a) = 
{M^(q) : q G X(a)} for each a in the sequence. 
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Write w = {wi., . . . ,wm-,w.i, . . . , w.m), and define the function A" : — )• 
j^A/xA/ g-^g ^YiQ (unique) solution to tlie optimization problem 

minimize V qjt" over q E ^ 

such that rj • q > Wi. and Cj • q > w.j for all 1 < j < M. 

This is the optimization problem defined in (35); we have simply written 
out H+(A) explicitly using (55). By Lemma 5.6, the map AVFiM^''*^-^ 
M^^^ that defines I{a) is simply the composition of W :R^''^ ^ Rf'^ 
and A":M2_w ^M^^''^^ Let q(a) = A°('u;). Note that q(a) EX(a); this is 
because q(a) is optimal for A°^{w), therefore it is optimal for A"(VF(q(a))) 
which has a smaller feasible region, therefore q(a) = A"(VF(q(a))) = 
AVF(q(a)), that is, q(a) EX(a). 

We next establish this claim: that for each a in the sequence, there exists i, 
j, i' and j' such that qij{a) = 0, qiij{a) > w.j/M and qi ji{a) > Wi./M. 
To prove this claim, observe that T4^(q(a)) >why the constraints of the 
optimization problem A°; and that q(a) E I{a) hence M^(q(a)) E W(a); 
hence l^(q(a)) 7^ ij; by assumption that w ^ W{a). Therefore VF(q(a)) > w 
in some component, that is, there is some i or j such that • q(Q) > Wi. or Cj ■ 
q(a) > w.j. Indeed, there must be both such an i and j, since otherwise the 
sum of row workloads and column workloads would not be equal. Therefore 
qij{a) = 0, since if qij{a) > 0, then we could reduce qij{a) and still have 
a feasible solution to the problem that defines A'^{w) but with a smaller 
value of the objective function, which contradicts optimality of q(a). There 
must also be a j' such that qiji[a) > Wi./M since otherwise the workload 
constraint rj • q(a) > Wi. would not be met. Likewise for i' . 

We assumed that w E M^*^ is in the relative interior of W™*^^ = {VF(q) : q E 
j^MxA^i^ This set is clearly convex, and from the characterization of relative 
interior for convex sets, for all x E W™*^ there exists y E W™^^ and < a < 1 
such that w = ax + {1 — a)y. In particular, by choosing x = W{1), we find 
that w > componentwise. 

For each a in the sequence we can find indices {i{a) , j (a) , i' (a) , j' (a)) as 
above. Some set of indices {i,j,i',j') must be repeated infinitely often, since 
there are only finitely many choices. Restrict attention to the subsequence 
of a for which i{a) =i, j{a) =j, i'{a) = i' , j'{a) = j' ■ 

Now, consider the submatrix 

Recall that q(a) EX(a). By Lemma A. 3, and the assumption that A > 
componentwise, every queue is involved in some maximum weight matching. 
By Lemma A. 2, every matching is a maximum weight matching. Let tt be 
any matching with TTjj = TTi'j' = 1, and let p be like tt except with {i,j) and 
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{i' flipped, that is, pij = pi'j' = and piji = pi/j = 1. We can write out 
explicitly the difference in weight between these two matchings 

P ■ q(")" - 71" • q(a)" = Qi'jiaT + - - Qi,j{<^T- 

Here, q(a)" denotes component-wise exponentiation. Recall that along the 
subsequence we have chosen, qij{a) = 0, qij'{a) > Wi./M and qi'j{a) > 
w.j/M. Therefore lim inf q,_>,o p ■ q(a)° — tt • q(a)" > 0. This contradicts the 
finding that every matching is a maximum weight matching for q(a). 

Thus, we have contradicted our original assumption that there exists a se- 
quence a \ with w ^ yV{a). This completes the proof. □ 

Proof of Lemma A. 2. Let a = max^p • x be the weight of the max- 
imum weight matching, let A = {p:p ■ x = a} and let a = YlpeAP- 
serve that cr £ Z*^^^ and Aij = 1^^. j.>o}- Therefore, cr > A componentwise. 
Further, by definition A > tt. Therefore, the matrix cr — tt is nonnegative. 
Since a is sum of |^| permutation matrices, all its row sums and column 
sums are equal to |^|. And since tt is a permutation matrix as well, the ma- 
trix cr — TV has all its row sums and column sums equal to |^| — 1; therefore 
by the Birkhoff-von Neumann decomposition 

a = TT + J2 o^pP^ 

where each > and = |^| — 1- Now, cr • x = \ A\a by construction 

of cr. Therefore 

\A\a = cr • X = TT • X + app ■ x 

pes 

< TT • x + (1^1 — l)o because a = max p • x. 

P&S 

Rearranging, tt • x > a. But a is the weight of the maximum weight matching, 
thus TT • X = a. □ 

Proof of Lemma A. 3. Since A G A, A < cr for some a £ {S), that is, 
cr = ^^g^OTrTT where ^a^- = 1 and each > 0. Therefore 

A • < cr • = a.„7v • q" < maxvr • q" = maxvr • q°. 

But by Lemma 5.11(iv), A • q" = max„ tt • q", therefore both the inequalities 
in the above must be equalities. In particular, all matchings tt for which 
a^r > are maximum weight matchings. If Xij > 0, then at least one of these 
matchings has TVij = 1. □ 

A. 3. Proof of Theorem 8.3(ii). Consider a 2 x 2 switch with arrival rate 
matrix A. Since A satisfies (54), we may write 

1 — Ai^i 
1 — Ai 1 Ai 1 
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for some Ai^i G (0,1). To find I{a) we use the characterization from Lem- 
ma 5.11(iv), which says that q € I{ct) if and only if A • = max„7v ■ q", 
that is, if and only if 

Al,l(9f,l + Qh) + (1 - Al>l)(9"2 + Q2,l) = «1 + 12,2) V (tf,2 + q2,l)- 

Now, the equation Ai^ix + (1 — ^i,i)y = xV y is satisfied if and only if x = y, 
given < Ai^i < 1. Therefore 

X(a) = {q G Rf^ : q^, + q^^ = q^^ + q^,}. 
We wish show that {M^(q) : q G is strictly increasing as a \j 0, where 

Vl^(q) = (ri -q, r2 -q, Ci -q, C2 -q). It suffices to show that VV(a) = {W^(q) : q G 
X(a)} is strictly increasing, where F[^(q) = (ri • q, Ci • q, 1 • q), since there is 
a straightforward bijection between W^(q) and Vi^(q). Now, {wi.,w.i,w..) G 
M.^ is in VV(a) iff there exists q G M^^^ such that 

Qi,i + Q2,2 = + Q2,i, ri-q = wi., Ci-q = u;.i, l-q = u;.., 
that is, iff there exists x G M such that 

(59) x" + {w.. - wi. - w.i + x)" - {wi. - x)° - {w.i - x)" = 0, 

(60) max(0, wi. + w.i — w..) < x < mm{wi.,w.i). 

Write 6{x) for the left-hand side of (59). Since 9{x) is increasing in x, there 
exists a solution to (59) iff 9{x) < at the lower bound in (60) and ^(x) > 
at the upper bound. By considering four separate cases of which of the 
bounds in (60) are tight, and after some algebra, we find that there exists 
a solution iff 

(61) Wi. + w.j + « + w'^j)^/"' > w.. for each i,j G {1,2}, 

where W2. = w.. — wi. and w.2 = w.. — w.i. Now, it is a standard inequality 
that for any x > and y > 0, and any < a < /3, 

(x" + ?/")i/° > (x^ + /)i/'^. 

Applying this inequality to (61), it follows that W(a) is strictly increasing 
as a \ 0. 
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